cortex: org/vision.org comparison

comparison org/vision.org @ 213:319963720179

fleshing out vision

author	Robert McIntyre <rlm@mit.edu>
date	Thu, 09 Feb 2012 08:11:10 -0700
parents	8e9825c38941
children	01d3e9855ef9

comparison

equal deleted inserted replaced

-:8e9825c38941
+:319963720179
 and then projecting it back onto a surface in the 3D world.
 #+caption: jMonkeyEngine supports multiple views to enable split-screen games, like GoldenEye
 [[../images/goldeneye-4-player.png]]
+* Brief Description of jMonkeyEngine's Rendering Pipeline
-Make the continuation in scene-processor take FrameBuffer,
+jMonkeyEngine allows you to create a =ViewPort=, which represents a
-byte-buffer, BufferedImage already sized to the correct
+view of the simulated world. You can create as many of these as you
-dimensions. the continuation will decide wether to "mix" them
+want. Every frame, the =RenderManager= iterates through each
-into the BufferedImage, lazily ignore them, or mix them halfway
+=ViewPort=, rendering the scene in the GPU. For each =ViewPort= there
-and call c/graphics card routines.
+is a =FrameBuffer= which represents the rendered image in the GPU.
-(vision creature) will take an optional :skip argument which will
+Each =ViewPort= can have any number of attached =SceneProcessor=
-inform the continuations in scene processor to skip the given
+objects, which are called every time a new frame is rendered. A
-number of cycles 0 means that no cycles will be skipped.
+=SceneProcessor= recieves a =FrameBuffer= and can do whatever it wants
+to the data.  Often this consists of invoking GPU specific operations
-(vision creature) will return [init-functions sensor-functions].
+on the rendered image.  The =SceneProcessor= can also copy the GPU
-The init-functions are each single-arg functions that take the
+image data to RAM and process it with the CPU.
-world and register the cameras and must each be called before the
-corresponding sensor-functions.  Each init-function returns the
+* The Vision Pipeline
-viewport for that eye which can be manipulated, saved, etc. Each
-sensor-function is a thunk and will return data in the same
+Each eye in the simulated creature needs it's own =ViewPort= so that
-format as the tactile-sensor functions the structure is
+it can see the world from its own perspective. To this =ViewPort=, I
-[topology, sensor-data]. Internally, these sensor-functions
+add a =SceneProcessor= that feeds the visual data to any arbitra
-maintain a reference to sensor-data which is periodically updated
+continuation function for further processing.  That continuation
-by the continuation function established by its init-function.
+function may perform both CPU and GPU operations on the data. To make
-They can be queried every cycle, but their information may not
+this easy for the continuation function, the =SceneProcessor=
-necessairly be different every cycle.
+maintains appropriatly sized buffers in RAM to hold the data.  It does
+not do any copying from the GPU to the CPU itself.
-Each eye in the creature in blender will work the same way as
+#+name: pipeline-1
-joints -- a zero dimensional object with no geometry whose local
+#+begin_src clojure
-coordinate system determines the orientation of the resulting
-eye. All eyes will have a parent named "eyes" just as all joints
-have a parent named "joints". The resulting camera will be a
-ChaseCamera or a CameraNode bound to the geo that is closest to
-the eye marker. The eye marker will contain the metadata for the
-eye, and will be moved by it's bound geometry. The dimensions of
-the eye's camera are equal to the dimensions of the eye's "UV"
-map.
-#+name: eyes
-#+begin_src clojure
-(ns cortex.vision
-"Simulate the sense of vision in jMonkeyEngine3. Enables multiple
-eyes from different positions to observe the same world, and pass
-the observed data to any arbitray function. Automatically reads
-eye-nodes from specially prepared blender files and instanttiates
-them in the world as actual eyes."
-{:author "Robert McIntyre"}
-(:use (cortex world sense util))
-(:use clojure.contrib.def)
-(:import com.jme3.post.SceneProcessor)
-(:import (com.jme3.util BufferUtils Screenshots))
-(:import java.nio.ByteBuffer)
-(:import java.awt.image.BufferedImage)
-(:import (com.jme3.renderer ViewPort Camera))
-(:import com.jme3.math.ColorRGBA)
-(:import com.jme3.renderer.Renderer)
-(:import com.jme3.app.Application)
-(:import com.jme3.texture.FrameBuffer)
-(:import (com.jme3.scene Node Spatial)))
 (defn vision-pipeline
 "Create a SceneProcessor object which wraps a vision processing
 continuation function. The continuation is a function that takes
 [#^Renderer r #^FrameBuffer fb #^ByteBuffer b #^BufferedImage bi],
 each of which has already been appropiately sized."
 (postFrame
 [#^FrameBuffer fb]
 (.clear @byte-buffer)
 (continuation @renderer fb @byte-buffer @image))
 (cleanup []))))
+#+end_src
+The continuation function given to =(vision-pipeline)= above will be
+given a =Renderer= and three containers for image data. The
+=FrameBuffer= references the GPU image data, but it can not be used
+directly on the CPU.  The =ByteBuffer= and =BufferedImage= are
+initially "empty" but are sized to hold to data in the
+=FrameBuffer=. I call transfering the GPU image data to the CPU
+structures "mixing" the image data. I have provided three functions to
+do this mixing.
+#+name: pipeline-2
+#+begin_src clojure
 (defn frameBuffer->byteBuffer!
 "Transfer the data in the graphics card (Renderer, FrameBuffer) to
 the CPU (ByteBuffer)."
 [#^Renderer r #^FrameBuffer fb #^ByteBuffer bb]
 (.readFrameBuffer r fb bb) bb)
 "Continuation which will grab the buffered image from the materials
 provided by (vision-pipeline)."
 [#^Renderer r #^FrameBuffer fb #^ByteBuffer bb #^BufferedImage bi]
 (byteBuffer->bufferedImage!
 (frameBuffer->byteBuffer! r fb bb) bi))
+#+end_src
+Note that it is possible to write vision processing algorithms
+entirely in terms of =BufferedImage= inputs. Just compose that
+=BufferedImage= algorithm with =(BufferedImage!)=. However, a vision
+processing algorithm that is entirely hosted on the GPU does not have
+to pay for this convienence.
+* Physical Eyes
+The vision pipeline described above only deals with
+Each eye in the creature in blender will work the same way as
+joints -- a zero dimensional object with no geometry whose local
+coordinate system determines the orientation of the resulting
+eye. All eyes will have a parent named "eyes" just as all joints
+have a parent named "joints". The resulting camera will be a
+ChaseCamera or a CameraNode bound to the geo that is closest to
+the eye marker. The eye marker will contain the metadata for the
+eye, and will be moved by it's bound geometry. The dimensions of
+the eye's camera are equal to the dimensions of the eye's "UV"
+map.
+(vision creature) will take an optional :skip argument which will
+inform the continuations in scene processor to skip the given
+number of cycles 0 means that no cycles will be skipped.
+(vision creature) will return [init-functions sensor-functions].
+The init-functions are each single-arg functions that take the
+world and register the cameras and must each be called before the
+corresponding sensor-functions.  Each init-function returns the
+viewport for that eye which can be manipulated, saved, etc. Each
+sensor-function is a thunk and will return data in the same
+format as the tactile-sensor functions the structure is
+[topology, sensor-data]. Internally, these sensor-functions
+maintain a reference to sensor-data which is periodically updated
+by the continuation function established by its init-function.
+They can be queried every cycle, but their information may not
+necessairly be different every cycle.
+#+begin_src clojure
 (defn add-camera!
 "Add a camera to the world, calling continuation on every frame
 produced."
 [#^Application world camera continuation]
 (let [width (.getWidth camera)
 (add-camera! world (.getCamera world) no-op)))
 (fn [world tpf]
 (.rotate candy (* tpf 0.2) 0 0)))))
 #+end_src
-#+results: test-vision
+#+name: vision-header
-: #'cortex.test.vision/test-two-eyes
+#+begin_src clojure
+(ns cortex.vision
+"Simulate the sense of vision in jMonkeyEngine3. Enables multiple
+eyes from different positions to observe the same world, and pass
+the observed data to any arbitray function. Automatically reads
+eye-nodes from specially prepared blender files and instanttiates
+them in the world as actual eyes."
+{:author "Robert McIntyre"}
+(:use (cortex world sense util))
+(:use clojure.contrib.def)
+(:import com.jme3.post.SceneProcessor)
+(:import (com.jme3.util BufferUtils Screenshots))
+(:import java.nio.ByteBuffer)
+(:import java.awt.image.BufferedImage)
+(:import (com.jme3.renderer ViewPort Camera))
+(:import com.jme3.math.ColorRGBA)
+(:import com.jme3.renderer.Renderer)
+(:import com.jme3.app.Application)
+(:import com.jme3.texture.FrameBuffer)
+(:import (com.jme3.scene Node Spatial)))
+#+end_src
 The example code will create two videos of the same rotating object
 from different angles. It can be used both for stereoscopic vision
 simulation or for simulating multiple creatures, each with their own
 sense of vision.

Mercurial > cortex

comparison org/vision.org @ 213:319963720179