comparison thesis/cortex.org @ 440:b01c070b03d4

save for tonight.
author Robert McIntyre <rlm@mit.edu>
date Sun, 23 Mar 2014 23:43:20 -0400
parents 97dc719fd1ac
children c20de2267d39
comparison
equal deleted inserted replaced
439:97dc719fd1ac 440:b01c070b03d4
39 hard problem! What features can you describe in terms of low level 39 hard problem! What features can you describe in terms of low level
40 functions of pixels that can even begin to describe what is 40 functions of pixels that can even begin to describe what is
41 happening here? 41 happening here?
42 42
43 Or suppose that you are building a program that recognizes 43 Or suppose that you are building a program that recognizes
44 chairs. How could you ``see'' the chair in the following picture? 44 chairs. How could you ``see'' the chair in the following pictures?
45 45
46 #+caption: When you look at this, do you think ``chair''? I certainly do. 46 #+caption: When you look at this, do you think ``chair''? I certainly do.
47 #+ATTR_LaTeX: :width 10cm 47 #+ATTR_LaTeX: :width 10cm
48 [[./images/invisible-chair.png]] 48 [[./images/invisible-chair.png]]
49 49
50 #+caption: The chair in this image is quite obvious to humans, but I 50 #+caption: The chair in this image is quite obvious to humans, but I
51 #+caption: doubt that any computer program can find it. 51 #+caption: doubt that any computer program can find it.
52 #+ATTR_LaTeX: :width 10cm 52 #+ATTR_LaTeX: :width 10cm
53 [[./images/fat-person-sitting-at-desk.jpg]] 53 [[./images/fat-person-sitting-at-desk.jpg]]
54 54
55 Finally, how is it that you can easily tell the difference between
56 how the girls /muscles/ are working in \ref{girl}?
55 57
56 I think humans are able to label 58 #+caption: The mysterious ``common sense'' appears here as you are able
57 such video as "drinking" because they imagine /themselves/ as the 59 #+caption: to ``see'' the difference in how the girl's arm muscles
58 cat, and imagine putting their face up against a stream of water and 60 #+caption: are activated differently in the two images.
59 sticking out their tongue. In that imagined world, they can feel the 61 #+name: girl
60 cool water hitting their tongue, and feel the water entering their 62 #+ATTR_LaTeX: :width 10cm
61 body, and are able to recognize that /feeling/ as drinking. So, the 63 [[./images/wall-push.png]]
62 label of the action is not really in the pixels of the image, but is 64
63 found clearly in a simulation inspired by those pixels. An 65
64 imaginative system, having been trained on drinking and non-drinking 66 These problems are difficult because the language of pixels is far
65 examples and learning that the most important component of drinking 67 removed from what we would consider to be an acceptable description
66 is the feeling of water sliding down one's throat, would analyze a 68 of the events in these images. In order to process them, we must
67 video of a cat drinking in the following manner: 69 raise the images into some higher level of abstraction where their
70 descriptions become more similar to how we would describe them in
71 English. The question is, how can we raise
72
73
74 I think humans are able to label such video as "drinking" because
75 they imagine /themselves/ as the cat, and imagine putting their face
76 up against a stream of water and sticking out their tongue. In that
77 imagined world, they can feel the cool water hitting their tongue,
78 and feel the water entering their body, and are able to recognize
79 that /feeling/ as drinking. So, the label of the action is not
80 really in the pixels of the image, but is found clearly in a
81 simulation inspired by those pixels. An imaginative system, having
82 been trained on drinking and non-drinking examples and learning that
83 the most important component of drinking is the feeling of water
84 sliding down one's throat, would analyze a video of a cat drinking
85 in the following manner:
68 86
69 - Create a physical model of the video by putting a "fuzzy" model 87 - Create a physical model of the video by putting a "fuzzy" model
70 of its own body in place of the cat. Also, create a simulation of 88 of its own body in place of the cat. Also, create a simulation of
71 the stream of water. 89 the stream of water.
72 90
78 - The action is now easily identified as drinking by the sense of 96 - The action is now easily identified as drinking by the sense of
79 taste alone. The other senses (such as the tongue moving in and 97 taste alone. The other senses (such as the tongue moving in and
80 out) help to give plausibility to the simulated action. Note that 98 out) help to give plausibility to the simulated action. Note that
81 the sense of vision, while critical in creating the simulation, 99 the sense of vision, while critical in creating the simulation,
82 is not critical for identifying the action from the simulation. 100 is not critical for identifying the action from the simulation.
83
84
85
86
87
88
89 101
90 cat drinking, mimes, leaning, common sense 102 cat drinking, mimes, leaning, common sense
91 103
92 ** =EMPATH= neatly solves recognition problems 104 ** =EMPATH= neatly solves recognition problems
93 105
117 129
118 ** Hearing is hard; =CORTEX= does it right 130 ** Hearing is hard; =CORTEX= does it right
119 131
120 ** Touch uses hundreds of hair-like elements 132 ** Touch uses hundreds of hair-like elements
121 133
122 ** Proprioception is the force that makes everything ``real'' 134 ** Proprioception is the sense that makes everything ``real''
123 135
124 ** Muscles are both effectors and sensors 136 ** Muscles are both effectors and sensors
125 137
126 ** =CORTEX= brings complex creatures to life! 138 ** =CORTEX= brings complex creatures to life!
127 139
137 149
138 ** \Phi-space describes the worm's experiences 150 ** \Phi-space describes the worm's experiences
139 151
140 ** Empathy is the process of tracing though \Phi-space 152 ** Empathy is the process of tracing though \Phi-space
141 153
142 ** Efficient action recognition via empathy 154 ** Efficient action recognition =EMPATH=
143 155
144 * Contributions 156 * Contributions
145 - Built =CORTEX=, a comprehensive platform for embodied AI 157 - Built =CORTEX=, a comprehensive platform for embodied AI
146 experiments. Has many new features lacking in other systems, such 158 experiments. Has many new features lacking in other systems, such
147 as sound. Easy to model/create new creatures. 159 as sound. Easy to model/create new creatures.