Mercurial > cortex
view org/vision.org @ 183:446b115bddc2
renamed sense-util.org to sense.org.
author | Robert McIntyre <rlm@mit.edu> |
---|---|
date | Sat, 04 Feb 2012 07:49:46 -0700 |
parents | 04a39e6bb695 |
children | 22548d48cc85 |
line wrap: on
line source
1 #+title: Simulated Sense of Sight2 #+author: Robert McIntyre3 #+email: rlm@mit.edu4 #+description: Simulated sight for AI research using JMonkeyEngine3 and clojure5 #+keywords: computer vision, jMonkeyEngine3, clojure6 #+SETUPFILE: ../../aurellem/org/setup.org7 #+INCLUDE: ../../aurellem/org/level-0.org8 #+babel: :mkdirp yes :noweb yes :exports both10 * COMMENT Vision12 I want to make creatures with eyes. Each eye can be independely moved13 and should see its own version of the world depending on where it is.15 Here's how vision will work.17 Make the continuation in scene-processor take FrameBuffer,18 byte-buffer, BufferedImage already sized to the correct19 dimensions. the continuation will decide wether to "mix" them20 into the BufferedImage, lazily ignore them, or mix them halfway21 and call c/graphics card routines.23 (vision creature) will take an optional :skip argument which will24 inform the continuations in scene processor to skip the given25 number of cycles 0 means that no cycles will be skipped.27 (vision creature) will return [init-functions sensor-functions].28 The init-functions are each single-arg functions that take the29 world and register the cameras and must each be called before the30 corresponding sensor-functions. Each init-function returns the31 viewport for that eye which can be manipulated, saved, etc. Each32 sensor-function is a thunk and will return data in the same33 format as the tactile-sensor functions the structure is34 [topology, sensor-data]. Internally, these sensor-functions35 maintain a reference to sensor-data which is periodically updated36 by the continuation function established by its init-function.37 They can be queried every cycle, but their information may not38 necessairly be different every cycle.40 Each eye in the creature in blender will work the same way as41 joints -- a zero dimensional object with no geometry whose local42 coordinate system determines the orientation of the resulting43 eye. All eyes will have a parent named "eyes" just as all joints44 have a parent named "joints". The resulting camera will be a45 ChaseCamera or a CameraNode bound to the geo that is closest to46 the eye marker. The eye marker will contain the metadata for the47 eye, and will be moved by it's bound geometry. The dimensions of48 the eye's camera are equal to the dimensions of the eye's "UV"49 map.51 #+name: eyes52 #+begin_src clojure53 (ns cortex.vision54 "Simulate the sense of vision in jMonkeyEngine3. Enables multiple55 eyes from different positions to observe the same world, and pass56 the observed data to any arbitray function. Automatically reads57 eye-nodes from specially prepared blender files and instanttiates58 them in the world as actual eyes."59 {:author "Robert McIntyre"}60 (:use (cortex world sense util))61 (:use clojure.contrib.def)62 (:import com.jme3.post.SceneProcessor)63 (:import (com.jme3.util BufferUtils Screenshots))64 (:import java.nio.ByteBuffer)65 (:import java.awt.image.BufferedImage)66 (:import (com.jme3.renderer ViewPort Camera))67 (:import com.jme3.math.ColorRGBA)68 (:import com.jme3.renderer.Renderer)69 (:import com.jme3.app.Application)70 (:import com.jme3.texture.FrameBuffer)71 (:import (com.jme3.scene Node Spatial)))73 (defn vision-pipeline74 "Create a SceneProcessor object which wraps a vision processing75 continuation function. The continuation is a function that takes76 [#^Renderer r #^FrameBuffer fb #^ByteBuffer b #^BufferedImage bi],77 each of which has already been appropiately sized."78 [continuation]79 (let [byte-buffer (atom nil)80 renderer (atom nil)81 image (atom nil)]82 (proxy [SceneProcessor] []83 (initialize84 [renderManager viewPort]85 (let [cam (.getCamera viewPort)86 width (.getWidth cam)87 height (.getHeight cam)]88 (reset! renderer (.getRenderer renderManager))89 (reset! byte-buffer90 (BufferUtils/createByteBuffer91 (* width height 4)))92 (reset! image (BufferedImage.93 width height94 BufferedImage/TYPE_4BYTE_ABGR))))95 (isInitialized [] (not (nil? @byte-buffer)))96 (reshape [_ _ _])97 (preFrame [_])98 (postQueue [_])99 (postFrame100 [#^FrameBuffer fb]101 (.clear @byte-buffer)102 (continuation @renderer fb @byte-buffer @image))103 (cleanup []))))105 (defn frameBuffer->byteBuffer!106 "Transfer the data in the graphics card (Renderer, FrameBuffer) to107 the CPU (ByteBuffer)."108 [#^Renderer r #^FrameBuffer fb #^ByteBuffer bb]109 (.readFrameBuffer r fb bb) bb)111 (defn byteBuffer->bufferedImage!112 "Convert the C-style BGRA image data in the ByteBuffer bb to the AWT113 style ABGR image data and place it in BufferedImage bi."114 [#^ByteBuffer bb #^BufferedImage bi]115 (Screenshots/convertScreenShot bb bi) bi)117 (defn BufferedImage!118 "Continuation which will grab the buffered image from the materials119 provided by (vision-pipeline)."120 [#^Renderer r #^FrameBuffer fb #^ByteBuffer bb #^BufferedImage bi]121 (byteBuffer->bufferedImage!122 (frameBuffer->byteBuffer! r fb bb) bi))124 (defn add-camera!125 "Add a camera to the world, calling continuation on every frame126 produced."127 [#^Application world camera continuation]128 (let [width (.getWidth camera)129 height (.getHeight camera)130 render-manager (.getRenderManager world)131 viewport (.createMainView render-manager "eye-view" camera)]132 (doto viewport133 (.setClearFlags true true true)134 (.setBackgroundColor ColorRGBA/Black)135 (.addProcessor (vision-pipeline continuation))136 (.attachScene (.getRootNode world)))))138 (defn retina-sensor-profile139 "Return a map of pixel selection functions to BufferedImages140 describing the distribution of light-sensitive components of this141 eye. Each function creates an integer from the rgb values found in142 the pixel. :red, :green, :blue, :gray are already defined as143 extracting the red, green, blue, and average components144 respectively."145 [#^Spatial eye]146 (if-let [eye-map (meta-data eye "eye")]147 (map-vals148 load-image149 (eval (read-string eye-map)))))151 (defn eye-dimensions152 "Returns [width, height] specified in the metadata of the eye"153 [#^Spatial eye]154 (let [dimensions155 (map #(vector (.getWidth %) (.getHeight %))156 (vals (retina-sensor-profile eye)))]157 [(apply max (map first dimensions))158 (apply max (map second dimensions))]))160 (defvar161 ^{:arglists '([creature])}162 eyes163 (sense-nodes "eyes")164 "Return the children of the creature's \"eyes\" node.")166 (defn add-eye!167 "Create a Camera centered on the current position of 'eye which168 follows the closest physical node in 'creature and sends visual169 data to 'continuation."170 [#^Node creature #^Spatial eye]171 (let [target (closest-node creature eye)172 [cam-width cam-height] (eye-dimensions eye)173 cam (Camera. cam-width cam-height)]174 (.setLocation cam (.getWorldTranslation eye))175 (.setRotation cam (.getWorldRotation eye))176 (.setFrustumPerspective177 cam 45 (/ (.getWidth cam) (.getHeight cam))178 1 1000)179 (bind-sense target cam)180 cam))182 (defvar color-channel-presets183 {:all 0xFFFFFF184 :red 0xFF0000185 :blue 0x0000FF186 :green 0x00FF00}187 "Bitmasks for common RGB color channels")189 (defn vision-fn190 "Returns a list of functions, each of which will return a color191 channel's worth of visual information when called inside a running192 simulation."193 [#^Node creature #^Spatial eye & {skip :skip :or {skip 0}}]194 (let [retinal-map (retina-sensor-profile eye)195 camera (add-eye! creature eye)196 vision-image197 (atom198 (BufferedImage. (.getWidth camera)199 (.getHeight camera)200 BufferedImage/TYPE_BYTE_BINARY))201 register-eye!202 (runonce203 (fn [world]204 (add-camera!205 world camera206 (let [counter (atom 0)]207 (fn [r fb bb bi]208 (if (zero? (rem (swap! counter inc) (inc skip)))209 (reset! vision-image210 (BufferedImage! r fb bb bi))))))))]211 (vec212 (map213 (fn [[key image]]214 (let [whites (white-coordinates image)215 topology (vec (collapse whites))216 mask (color-channel-presets key)]217 (fn [world]218 (register-eye! world)219 (vector220 topology221 (vec222 (for [[x y] whites]223 (bit-and224 mask (.getRGB @vision-image x y))))))))225 retinal-map))))228 ;; TODO maybe should add a viewport-manipulation function to229 ;; automatically change viewport settings, attach shadow filters, etc.231 (defn vision!232 "Returns a function which returns visual sensory data when called233 inside a running simulation"234 [#^Node creature & {skip :skip :or {skip 0}}]235 (reduce236 concat237 (for [eye (eyes creature)]238 (vision-fn creature eye))))240 #+end_src243 Note the use of continuation passing style for connecting the eye to a244 function to process the output. You can create any number of eyes, and245 each of them will see the world from their own =Camera=. Once every246 frame, the rendered image is copied to a =BufferedImage=, and that247 data is sent off to the continuation function. Moving the =Camera=248 which was used to create the eye will change what the eye sees.250 * Example252 #+name: test-vision253 #+begin_src clojure254 (ns cortex.test.vision255 (:use (cortex world util vision))256 (:import java.awt.image.BufferedImage)257 (:import javax.swing.JPanel)258 (:import javax.swing.SwingUtilities)259 (:import java.awt.Dimension)260 (:import javax.swing.JFrame)261 (:import com.jme3.math.ColorRGBA)262 (:import com.jme3.scene.Node)263 (:import com.jme3.math.Vector3f))265 (defn test-two-eyes266 "Testing vision:267 Tests the vision system by creating two views of the same rotating268 object from different angles and displaying both of those views in269 JFrames.271 You should see a rotating cube, and two windows,272 each displaying a different view of the cube."273 []274 (let [candy275 (box 1 1 1 :physical? false :color ColorRGBA/Blue)]276 (world277 (doto (Node.)278 (.attachChild candy))279 {}280 (fn [world]281 (let [cam (.clone (.getCamera world))282 width (.getWidth cam)283 height (.getHeight cam)]284 (add-camera! world cam285 ;;no-op286 (comp (view-image) BufferedImage!)287 )288 (add-camera! world289 (doto (.clone cam)290 (.setLocation (Vector3f. -10 0 0))291 (.lookAt Vector3f/ZERO Vector3f/UNIT_Y))292 ;;no-op293 (comp (view-image) BufferedImage!))294 ;; This is here to restore the main view295 ;; after the other views have completed processing296 (add-camera! world (.getCamera world) no-op)))297 (fn [world tpf]298 (.rotate candy (* tpf 0.2) 0 0)))))299 #+end_src301 #+results: test-vision302 : #'cortex.test.vision/test-two-eyes304 The example code will create two videos of the same rotating object305 from different angles. It can be used both for stereoscopic vision306 simulation or for simulating multiple creatures, each with their own307 sense of vision.309 - As a neat bonus, this idea behind simulated vision also enables one310 to [[../../cortex/html/capture-video.html][capture live video feeds from jMonkeyEngine]].313 * COMMENT code generation314 #+begin_src clojure :tangle ../src/cortex/vision.clj315 <<eyes>>316 #+end_src318 #+begin_src clojure :tangle ../src/cortex/test/vision.clj319 <<test-vision>>320 #+end_src