Mercurial > cortex
view org/sense.org @ 213:319963720179
fleshing out vision
author | Robert McIntyre <rlm@mit.edu> |
---|---|
date | Thu, 09 Feb 2012 08:11:10 -0700 |
parents | 97b8caf66824 |
children | f283c62bd212 |
line wrap: on
line source
1 #+title: Helper Functions / Motivations2 #+author: Robert McIntyre3 #+email: rlm@mit.edu4 #+description: sensory utilities5 #+keywords: simulation, jMonkeyEngine3, clojure, simulated senses6 #+SETUPFILE: ../../aurellem/org/setup.org7 #+INCLUDE: ../../aurellem/org/level-0.org10 * Blender Utilities11 In blender, any object can be assigned an arbitray number of key-value12 pairs which are called "Custom Properties". These are accessable in13 jMonkyeEngine when blender files are imported with the14 =BlenderLoader=. =(meta-data)= extracts these properties.16 #+name: blender-117 #+begin_src clojure18 (defn meta-data19 "Get the meta-data for a node created with blender."20 [blender-node key]21 (if-let [data (.getUserData blender-node "properties")]22 (.findValue data key) nil))23 #+end_src25 Blender uses a different coordinate system than jMonkeyEngine so it26 is useful to be able to convert between the two. These only come into27 play when the meta-data of a node refers to a vector in the blender28 coordinate system.30 #+name: blender-231 #+begin_src clojure32 (defn jme-to-blender33 "Convert from JME coordinates to Blender coordinates"34 [#^Vector3f in]35 (Vector3f. (.getX in) (- (.getZ in)) (.getY in)))37 (defn blender-to-jme38 "Convert from Blender coordinates to JME coordinates"39 [#^Vector3f in]40 (Vector3f. (.getX in) (.getZ in) (- (.getY in))))41 #+end_src43 * Sense Topology45 Human beings are three-dimensional objects, and the nerves that46 transmit data from our various sense organs to our brain are47 essentially one-dimensional. This leaves up to two dimensions in which48 our sensory information may flow. For example, imagine your skin: it49 is a two-dimensional surface around a three-dimensional object (your50 body). It has discrete touch sensors embedded at various points, and51 the density of these sensors corresponds to the sensitivity of that52 region of skin. Each touch sensor connects to a nerve, all of which53 eventually are bundled together as they travel up the spinal cord to54 the brain. Intersect the spinal nerves with a guillotining plane and55 you will see all of the sensory data of the skin revealed in a roughly56 circular two-dimensional image which is the cross section of the57 spinal cord. Points on this image that are close together in this58 circle represent touch sensors that are /probably/ close together on59 the skin, although there is of course some cutting and rerangement60 that has to be done to transfer the complicated surface of the skin61 onto a two dimensional image.63 Most human senses consist of many discrete sensors of various64 properties distributed along a surface at various densities. For65 skin, it is Pacinian corpuscles, Meissner's corpuscles, Merkel's66 disks, and Ruffini's endings, which detect pressure and vibration of67 various intensities. For ears, it is the stereocilia distributed68 along the basilar membrane inside the cochlea; each one is sensitive69 to a slightly different frequency of sound. For eyes, it is rods70 and cones distributed along the surface of the retina. In each case,71 we can describe the sense with a surface and a distribution of sensors72 along that surface.74 ** UV-maps76 Blender and jMonkeyEngine already have support for exactly this sort77 of data structure because it is used to "skin" models for games. It is78 called [[http://wiki.blender.org/index.php/Doc:2.6/Manual/Textures/Mapping/UV][UV-mapping]]. The three-dimensional surface of a model is cut79 and smooshed until it fits on a two-dimensional image. You paint80 whatever you want on that image, and when the three-dimensional shape81 is rendered in a game the smooshing and cutting us reversed and the82 image appears on the three-dimensional object.84 To make a sense, interpret the UV-image as describing the distribution85 of that senses sensors. To get different types of sensors, you can86 either use a different color for each type of sensor, or use multiple87 UV-maps, each labeled with that sensor type. I generally use a white88 pixel to mean the presense of a sensor and a black pixel to mean the89 absense of a sensor, and use one UV-map for each sensor-type within a90 given sense. The paths to the images are not stored as the actual91 UV-map of the blender object but are instead referenced in the92 meta-data of the node.94 #+CAPTION: The UV-map for an enlongated icososphere. The white dots each represent a touch sensor. They are dense in the regions that describe the tip of the finger, and less dense along the dorsal side of the finger opposite the tip.95 #+ATTR_HTML: width="300"96 [[../images/finger-UV.png]]98 #+CAPTION: Ventral side of the UV-mapped finger. Notice the density of touch sensors at the tip.99 #+ATTR_HTML: width="300"100 [[../images/finger-1.png]]102 #+CAPTION: Side view of the UV-mapped finger.103 #+ATTR_HTML: width="300"104 [[../images/finger-2.png]]106 #+CAPTION: Head on view of the finger. In both the head and side views you can see the divide where the touch-sensors transition from high density to low density.107 #+ATTR_HTML: width="300"108 [[../images/finger-3.png]]110 The following code loads images and gets the locations of the white111 pixels so that they can be used to create senses. =(load-image)= finds112 images using jMonkeyEngine's asset-manager, so the image path is113 expected to be relative to the =assets= directory. Thanks to Dylan114 for the beautiful version of =(filter-pixels)=.116 #+name: topology-1117 #+begin_src clojure118 (defn load-image119 "Load an image as a BufferedImage using the asset-manager system."120 [asset-relative-path]121 (ImageToAwt/convert122 (.getImage (.loadTexture (asset-manager) asset-relative-path))123 false false 0))125 (def white 0xFFFFFF)127 (defn white? [rgb]128 (= (bit-and white rgb) white))130 (defn filter-pixels131 "List the coordinates of all pixels matching pred, within the bounds132 provided. If bounds are not specified then the entire image is133 searched.134 bounds -> [x0 y0 width height]"135 {:author "Dylan Holmes"}136 ([pred #^BufferedImage image]137 (filter-pixels pred image [0 0 (.getWidth image) (.getHeight image)]))138 ([pred #^BufferedImage image [x0 y0 width height]]139 ((fn accumulate [x y matches]140 (cond141 (>= y (+ height y0)) matches142 (>= x (+ width x0)) (recur 0 (inc y) matches)143 (pred (.getRGB image x y))144 (recur (inc x) y (conj matches [x y]))145 :else (recur (inc x) y matches)))146 x0 y0 [])))148 (defn white-coordinates149 "Coordinates of all the white pixels in a subset of the image."150 ([#^BufferedImage image bounds]151 (filter-pixels white? image bounds))152 ([#^BufferedImage image]153 (filter-pixels white? image)))154 #+end_src156 ** Topology158 Information from the senses is transmitted to the brain via bundles of159 axons, whether it be the optic nerve or the spinal cord. While these160 bundles more or less perserve the overall topology of a sense's161 two-dimensional surface, they do not perserve the percise euclidean162 distances between every sensor. =(collapse)= is here to smoosh the163 sensors described by a UV-map into a contigous region that still164 perserves the topology of the original sense.166 #+name: topology-2167 #+begin_src clojure168 (defn average [coll]169 (/ (reduce + coll) (count coll)))171 (defn collapse-1d172 "One dimensional analogue of collapse."173 [center line]174 (let [length (count line)175 num-above (count (filter (partial < center) line))176 num-below (- length num-above)]177 (range (- center num-below)178 (+ center num-above))))180 (defn collapse181 "Take a set of pairs of integers and collapse them into a182 contigous bitmap with no \"holes\"."183 [points]184 (if (empty? points) []185 (let186 [num-points (count points)187 center (vector188 (int (average (map first points)))189 (int (average (map first points))))190 flattened191 (reduce192 concat193 (map194 (fn [column]195 (map vector196 (map first column)197 (collapse-1d (second center)198 (map second column))))199 (partition-by first (sort-by first points))))200 squeezed201 (reduce202 concat203 (map204 (fn [row]205 (map vector206 (collapse-1d (first center)207 (map first row))208 (map second row)))209 (partition-by second (sort-by second flattened))))210 relocated211 (let [min-x (apply min (map first squeezed))212 min-y (apply min (map second squeezed))]213 (map (fn [[x y]]214 [(- x min-x)215 (- y min-y)])216 squeezed))]217 relocated)))218 #+end_src219 * Viewing Sense Data221 It's vital to /see/ the sense data to make sure that everything is222 behaving as it should. =(view-sense)= and its helper, =(view-image)=223 are here so that each sense can define its own way of turning224 sense-data into pictures, while the actual rendering of said pictures225 stays in one central place. =(points->image)= helps senses generate a226 base image onto which they can overlay actual sense data.228 #+name: view-senses229 #+begin_src clojure230 (in-ns 'cortex.sense)232 (defn view-image233 "Initailizes a JPanel on which you may draw a BufferedImage.234 Returns a function that accepts a BufferedImage and draws it to the235 JPanel. If given a directory it will save the images as png files236 starting at 0000000.png and incrementing from there."237 ([#^File save]238 (let [idx (atom -1)239 image240 (atom241 (BufferedImage. 1 1 BufferedImage/TYPE_4BYTE_ABGR))242 panel243 (proxy [JPanel] []244 (paint245 [graphics]246 (proxy-super paintComponent graphics)247 (.drawImage graphics @image 0 0 nil)))248 frame (JFrame. "Display Image")]249 (SwingUtilities/invokeLater250 (fn []251 (doto frame252 (-> (.getContentPane) (.add panel))253 (.pack)254 (.setLocationRelativeTo nil)255 (.setResizable true)256 (.setVisible true))))257 (fn [#^BufferedImage i]258 (reset! image i)259 (.setSize frame (+ 8 (.getWidth i)) (+ 28 (.getHeight i)))260 (.repaint panel 0 0 (.getWidth i) (.getHeight i))261 (if save262 (ImageIO/write263 i "png"264 (File. save (format "%07d.png" (swap! idx inc))))))))265 ([] (view-image nil)))267 (defn view-sense268 "Take a kernel that produces a BufferedImage from some sense data269 and return a function which takes a list of sense data, uses the270 kernel to convert to images, and displays those images, each in271 its own JFrame."272 [sense-display-kernel]273 (let [windows (atom [])]274 (fn [data]275 (if (> (count data) (count @windows))276 (reset!277 windows (map (fn [_] (view-image)) (range (count data)))))278 (dorun279 (map280 (fn [display datum]281 (display (sense-display-kernel datum)))282 @windows data)))))284 (defn points->image285 "Take a collection of points and visuliaze it as a BufferedImage."286 [points]287 (if (empty? points)288 (BufferedImage. 1 1 BufferedImage/TYPE_BYTE_BINARY)289 (let [xs (vec (map first points))290 ys (vec (map second points))291 x0 (apply min xs)292 y0 (apply min ys)293 width (- (apply max xs) x0)294 height (- (apply max ys) y0)295 image (BufferedImage. (inc width) (inc height)296 BufferedImage/TYPE_INT_RGB)]297 (dorun298 (for [x (range (.getWidth image))299 y (range (.getHeight image))]300 (.setRGB image x y 0xFF0000)))301 (dorun302 (for [index (range (count points))]303 (.setRGB image (- (xs index) x0) (- (ys index) y0) -1)))304 image)))306 (defn gray307 "Create a gray RGB pixel with R, G, and B set to num. num must be308 between 0 and 255."309 [num]310 (+ num311 (bit-shift-left num 8)312 (bit-shift-left num 16)))313 #+end_src315 * Building a Sense from Nodes316 My method for defining senses in blender is the following:318 Senses like vision and hearing are localized to a single point319 and follow a particular object around. For these:321 - Create a single top-level empty node whose name is the name of the sense322 - Add empty nodes which each contain meta-data relevant323 to the sense, including a UV-map describing the number/distribution324 of sensors if applicipable.325 - Make each empty-node the child of the top-level326 node. =(sense-nodes)= below generates functions to find these children.328 For touch, store the path to the UV-map which describes touch-sensors in the329 meta-data of the object to which that map applies.331 Each sense provides code that analyzes the Node structure of the332 creature and creates sense-functions. They also modify the Node333 structure if necessary.335 Empty nodes created in blender have no appearance or physical presence336 in jMonkeyEngine, but do appear in the scene graph. Empty nodes that337 represent a sense which "follows" another geometry (like eyes and338 ears) follow the closest physical object. =(closest-node)= finds this339 closest object given the Creature and a particular empty node.341 #+name: node-1342 #+begin_src clojure343 (defn sense-nodes344 "For some senses there is a special empty blender node whose345 children are considered markers for an instance of that sense. This346 function generates functions to find those children, given the name347 of the special parent node."348 [parent-name]349 (fn [#^Node creature]350 (if-let [sense-node (.getChild creature parent-name)]351 (seq (.getChildren sense-node))352 (do (println-repl "could not find" parent-name "node") []))))354 (defn closest-node355 "Return the physical node in creature which is closest to the given356 node."357 [#^Node creature #^Node empty]358 (loop [radius (float 0.01)]359 (let [results (CollisionResults.)]360 (.collideWith361 creature362 (BoundingBox. (.getWorldTranslation empty)363 radius radius radius)364 results)365 (if-let [target (first results)]366 (.getGeometry target)367 (recur (float (* 2 radius)))))))369 (defn world-to-local370 "Convert the world coordinates into coordinates relative to the371 object (i.e. local coordinates), taking into account the rotation372 of object."373 [#^Spatial object world-coordinate]374 (.worldToLocal object world-coordinate nil))376 (defn local-to-world377 "Convert the local coordinates into world relative coordinates"378 [#^Spatial object local-coordinate]379 (.localToWorld object local-coordinate nil))380 #+end_src382 ** Sense Binding384 =(bind-sense)= binds either a Camera or a Listener object to any385 object so that they will follow that object no matter how it386 moves. It is used to create both eyes and ears.388 #+name: node-2389 #+begin_src clojure390 (defn bind-sense391 "Bind the sense to the Spatial such that it will maintain its392 current position relative to the Spatial no matter how the spatial393 moves. 'sense can be either a Camera or Listener object."394 [#^Spatial obj sense]395 (let [sense-offset (.subtract (.getLocation sense)396 (.getWorldTranslation obj))397 initial-sense-rotation (Quaternion. (.getRotation sense))398 base-anti-rotation (.inverse (.getWorldRotation obj))]399 (.addControl400 obj401 (proxy [AbstractControl] []402 (controlUpdate [tpf]403 (let [total-rotation404 (.mult base-anti-rotation (.getWorldRotation obj))]405 (.setLocation406 sense407 (.add408 (.mult total-rotation sense-offset)409 (.getWorldTranslation obj)))410 (.setRotation411 sense412 (.mult total-rotation initial-sense-rotation))))413 (controlRender [_ _])))))414 #+end_src416 Here is some example code which shows how a camera bound to a blue box417 with =(bind-sense)= moves as the box is buffeted by white cannonballs.419 #+name: test420 #+begin_src clojure421 (defn test-bind-sense422 "Show a camera that stays in the same relative position to a blue423 cube."424 []425 (let [eye-pos (Vector3f. 0 30 0)426 rock (box 1 1 1 :color ColorRGBA/Blue427 :position (Vector3f. 0 10 0)428 :mass 30)429 table (box 3 1 10 :color ColorRGBA/Gray :mass 0430 :position (Vector3f. 0 -3 0))]431 (world432 (nodify [rock table])433 standard-debug-controls434 (fn init [world]435 (let [cam (doto (.clone (.getCamera world))436 (.setLocation eye-pos)437 (.lookAt Vector3f/ZERO438 Vector3f/UNIT_X))]439 (bind-sense rock cam)440 (.setTimer world (RatchetTimer. 60))441 (Capture/captureVideo442 world (File. "/home/r/proj/cortex/render/bind-sense0"))443 (add-camera!444 world cam445 (comp (view-image446 (File. "/home/r/proj/cortex/render/bind-sense1"))447 BufferedImage!))448 (add-camera! world (.getCamera world) no-op)))449 no-op)))450 #+end_src452 #+begin_html453 <video controls="controls" width="755">454 <source src="../video/bind-sense.ogg" type="video/ogg"455 preload="none" poster="../images/aurellem-1280x480.png" />456 </video>457 #+end_html459 With this, eyes are easy --- you just bind the camera closer to the460 desired object, and set it to look outward instead of inward as it461 does in the video.463 (nb : the video was created with the following commands)465 *** Combine Frames with ImageMagick466 #+begin_src clojure :results silent467 (in-ns 'user)468 (import java.io.File)469 (use 'clojure.contrib.shell-out)470 (let471 [idx (atom -1)472 left (rest473 (sort474 (file-seq (File. "/home/r/proj/cortex/render/bind-sense0/"))))475 right (rest476 (sort477 (file-seq478 (File. "/home/r/proj/cortex/render/bind-sense1/"))))479 sub (rest480 (sort481 (file-seq482 (File. "/home/r/proj/cortex/render/bind-senseB/"))))483 sub* (concat sub (repeat 1000 (last sub)))]484 (dorun485 (map486 (fn [im-1 im-2 sub]487 (sh "convert" (.getCanonicalPath im-1)488 (.getCanonicalPath im-2) "+append"489 (.getCanonicalPath sub) "-append"490 (.getCanonicalPath491 (File. "/home/r/proj/cortex/render/bind-sense/"492 (format "%07d.png" (swap! idx inc))))))493 left right sub*)))494 #+end_src496 *** Encode Frames with ffmpeg498 #+begin_src sh :results silent499 cd /home/r/proj/cortex/render/500 ffmpeg -r 60 -b 9000k -i bind-sense/%07d.png bind-sense.ogg501 #+end_src503 * Headers504 #+name: sense-header505 #+begin_src clojure506 (ns cortex.sense507 "Here are functions useful in the construction of two or more508 sensors/effectors."509 {:author "Robert McInytre"}510 (:use (cortex world util))511 (:import ij.process.ImageProcessor)512 (:import jme3tools.converters.ImageToAwt)513 (:import java.awt.image.BufferedImage)514 (:import com.jme3.collision.CollisionResults)515 (:import com.jme3.bounding.BoundingBox)516 (:import (com.jme3.scene Node Spatial))517 (:import com.jme3.scene.control.AbstractControl)518 (:import (com.jme3.math Quaternion Vector3f))519 (:import javax.imageio.ImageIO)520 (:import java.io.File)521 (:import (javax.swing JPanel JFrame SwingUtilities)))522 #+end_src524 #+name: test-header525 #+begin_src clojure526 (ns cortex.test.sense527 (:use (cortex world util sense vision))528 (:import529 java.io.File530 (com.jme3.math Vector3f ColorRGBA)531 (com.aurellem.capture RatchetTimer Capture)))532 #+end_src534 * Source Listing535 - [[../src/cortex/sense.clj][cortex.sense]]536 - [[../src/cortex/test/sense.clj][cortex.test.sense]]537 - [[../assets/Models/subtitles/subtitles.blend][subtitles.blend]]538 - [[../assets/Models/subtitles/Lake_CraterLake03_sm.hdr][subtitles reflection map]]539 #+html: <ul> <li> <a href="../org/sense.org">This org file</a> </li> </ul>541 * Next542 Now that some of the preliminaries are out of the way, in the [[./body.org][next543 post]] I'll create a simulated body.546 * COMMENT generate source547 #+begin_src clojure :tangle ../src/cortex/sense.clj548 <<sense-header>>549 <<blender-1>>550 <<blender-2>>551 <<topology-1>>552 <<topology-2>>553 <<node-1>>554 <<node-2>>555 <<view-senses>>556 #+end_src558 #+begin_src clojure :tangle ../src/cortex/test/sense.clj559 <<test-header>>560 <<test>>561 #+end_src