cortex: thesis/org/roadmap.org annotate

annotate thesis/org/roadmap.org @ 553:20f64a70f8c5

saving image and caption mods suggested by winston.

author	Robert McIntyre <rlm@mit.edu>
date	Fri, 02 May 2014 14:08:09 -0400
parents	8e52a2802821
children	6a61b637a4c5

rev	line source
rlm@401	1 In order for this to be a reasonable thesis that I can be proud of,
rlm@401	2 what are the /minimum/ number of things I need to get done?
rlm@401	3
rlm@401	4
rlm@401	5 * worm OR hand registration
rlm@401	6 - training from a few examples (2 to start out)
rlm@401	7 - aligning the body with the scene
rlm@401	8 - generating sensory data
rlm@401	9 - matching previous labeled examples using dot-products or some
rlm@401	10 other basic thing
rlm@401	11 - showing that it works with different views
rlm@401	12
rlm@401	13 * first draft
rlm@401	14 - draft of thesis without bibliography or formatting
rlm@401	15 - should have basic experiment and have full description of
rlm@401	16 framework with code
rlm@401	17 - review with Winston
rlm@401	18
rlm@401	19 * final draft
rlm@401	20 - implement stretch goals from Winston if possible
rlm@401	21 - complete final formatting and submit
rlm@401	22
rlm@401	23 * CORTEX
rlm@401	24 DEADLINE: <2014-05-09 Fri>
rlm@401	25 SHIT THAT'S IN 67 DAYS!!!
rlm@401	26
rlm@403	27 ** program simple feature matching code for the worm's segments
rlm@403	28
rlm@401	29 Subgoals:
rlm@401	30 *** DONE Get cortex working again, run tests, no jmonkeyengine updates
rlm@401	31 CLOSED: [2014-03-03 Mon 22:07] SCHEDULED: <2014-03-03 Mon>
rlm@401	32 *** DONE get blender working again
rlm@401	33 CLOSED: [2014-03-03 Mon 22:43] SCHEDULED: <2014-03-03 Mon>
rlm@401	34 *** DONE make sparce touch worm segment in blender
rlm@401	35 CLOSED: [2014-03-03 Mon 23:16] SCHEDULED: <2014-03-03 Mon>
rlm@401	36 CLOCK: [2014-03-03 Mon 22:44]--[2014-03-03 Mon 23:16] => 0:32
rlm@401	37 *** DONE make multi-segment touch worm with touch sensors and display
rlm@401	38 CLOSED: [2014-03-03 Mon 23:54] SCHEDULED: <2014-03-03 Mon>
rlm@401	39
rlm@401	40 *** DONE Make a worm wiggle and curl
rlm@401	41 CLOSED: [2014-03-04 Tue 23:03] SCHEDULED: <2014-03-04 Tue>
rlm@403	42
rlm@401	43
rlm@401	44 ** First draft
rlm@403	45
rlm@401	46 Subgoals:
rlm@401	47 *** Writeup new worm experiments.
rlm@401	48 *** Triage implementation code and get it into chapter form.
rlm@401	49
rlm@401	50
rlm@401	51
rlm@401	52
rlm@401	53
rlm@401	54 ** for today
rlm@401	55
rlm@401	56 - guided worm :: control the worm with the keyboard. Useful for
rlm@401	57 testing the body-centered recog scripts, and for
rlm@401	58 preparing a cool demo video.
rlm@401	59
rlm@401	60 - body-centered recognition :: detect actions using hard coded
rlm@401	61 body-centered scripts.
rlm@401	62
rlm@401	63 - cool demo video of the worm being moved and recognizing things ::
rlm@401	64 will be a neat part of the thesis.
rlm@401	65
rlm@401	66 - thesis export :: refactoring and organization of code so that it
rlm@401	67 spits out a thesis in addition to the web page.
rlm@401	68
rlm@401	69 - video alignment :: analyze the frames of a video in order to align
rlm@401	70 the worm. Requires body-centered recognition. Can "cheat".
rlm@401	71
rlm@401	72 - smoother actions :: use debugging controls to directly influence the
rlm@401	73 demo actions, and to generate recoginition procedures.
rlm@401	74
rlm@401	75 - degenerate video demonstration :: show the system recognizing a
rlm@401	76 curled worm from dead on. Crowning achievement of thesis.
rlm@401	77
rlm@401	78 ** Ordered from easiest to hardest
rlm@401	79
rlm@401	80 Just report the positions of everything. I don't think that this
rlm@401	81 necessairly shows anything usefull.
rlm@401	82
rlm@401	83 Worm-segment vision -- you initialize a view of the worm, but instead
rlm@401	84 of pixels you use labels via ray tracing. Has the advantage of still
rlm@401	85 allowing for visual occlusion, but reliably identifies the objects,
rlm@401	86 even without rainbow coloring. You can code this as an image.
rlm@401	87
rlm@401	88 Same as above, except just with worm/non-worm labels.
rlm@401	89
rlm@401	90 Color code each worm segment and then recognize them using blob
rlm@401	91 detectors. Then you solve for the perspective and the action
rlm@401	92 simultaneously.
rlm@401	93
rlm@401	94 The entire worm can be colored the same, high contrast color against a
rlm@401	95 nearly black background.
rlm@401	96
rlm@401	97 "Rooted" vision. You give the exact coordinates of ONE piece of the
rlm@401	98 worm, but the algorithm figures out the rest.
rlm@401	99
rlm@401	100 More rooted vision -- start off the entire worm with one posistion.
rlm@401	101
rlm@401	102 The right way to do alignment is to use motion over multiple frames to
rlm@401	103 snap individual pieces of the model into place sharing and
rlm@401	104 propragating the individual alignments over the whole model. We also
rlm@401	105 want to limit the alignment search to just those actions we are
rlm@401	106 prepared to identify. This might mean that I need some small "micro
rlm@401	107 actions" such as the individual movements of the worm pieces.
rlm@401	108
rlm@401	109 Get just the centers of each segment projected onto the imaging
rlm@401	110 plane. (best so far).
rlm@401	111
rlm@401	112
rlm@401	113 Repertoire of actions + video frames -->
rlm@401	114 directed multi-frame-search alg
rlm@401	115
rlm@401	116
rlm@401	117
rlm@401	118
rlm@401	119
rlm@401	120
rlm@401	121 !! Could also have a bounding box around the worm provided by
rlm@401	122 filtering the worm/non-worm render, and use bbbgs. As a bonus, I get
rlm@401	123 to include bbbgs in my thesis! Could finally do that recursive things
rlm@401	124 where I make bounding boxes be those things that give results that
rlm@401	125 give good bounding boxes. If I did this I could use a disruptive
rlm@401	126 pattern on the worm.
rlm@401	127
rlm@401	128 Re imagining using default textures is very simple for this system,
rlm@401	129 but hard for others.
rlm@401	130
rlm@401	131
rlm@401	132 Want to demonstrate, at minimum, alignment of some model of the worm
rlm@401	133 to the video, and a lookup of the action by simulated perception.
rlm@401	134
rlm@401	135 note: the purple/white points is a very beautiful texture, because
rlm@401	136 when it moves slightly, the white dots look like they're
rlm@401	137 twinkling. Would look even better if it was a darker purple. Also
rlm@401	138 would look better more spread out.
rlm@401	139
rlm@401	140
rlm@401	141 embed assumption of one frame of view, search by moving around in
rlm@401	142 simulated world.
rlm@401	143
rlm@401	144 Allowed to limit search by setting limits to a hemisphere around the
rlm@401	145 imagined worm! This limits scale also.
rlm@401	146
rlm@401	147
rlm@401	148
rlm@401	149
rlm@401	150
rlm@401	151 !! Limited search with worm/non-worm rendering.
rlm@401	152 How much inverse kinematics do we have to do?
rlm@401	153 What about cached (allowed state-space) paths, derived from labeled
rlm@401	154 training. You have to lead from one to another.
rlm@401	155
rlm@401	156 What about initial state? Could start the input videos at a specific
rlm@401	157 state, then just match that explicitly.
rlm@401	158
rlm@401	159 !! The training doesn't have to be labeled -- you can just move around
rlm@401	160 for a while!!
rlm@401	161
rlm@401	162 !! Limited search with motion based alignment.
rlm@401	163
rlm@401	164
rlm@401	165
rlm@401	166
rlm@401	167 "play arounds" can establish a chain of linked sensoriums. Future
rlm@401	168 matches must fall into one of the already experienced things, and once
rlm@401	169 they do, it greatly limits the things that are possible in the future.
rlm@401	170
rlm@401	171
rlm@401	172 frame differences help to detect muscle exertion.
rlm@401	173
rlm@401	174 Can try to match on a few "representative" frames. Can also just have
rlm@401	175 a few "bodies" in various states which we try to match.
rlm@401	176
rlm@401	177
rlm@401	178
rlm@401	179 Paths through state-space have the exact same signature as
rlm@401	180 simulation. BUT, these can be searched in parallel and don't interfere
rlm@401	181 with each other.
rlm@401	182
rlm@401	183
rlm@402	184
rlm@402	185
rlm@402	186 ** Final stretch up to First Draft
rlm@402	187
rlm@404	188 *** DONE complete debug control of worm
rlm@404	189 CLOSED: [2014-03-17 Mon 17:29] SCHEDULED: <2014-03-17 Mon>
rlm@404	190 CLOCK: [2014-03-17 Mon 14:01]--[2014-03-17 Mon 17:29] => 3:28
rlm@405	191 *** DONE add phi-space output to debug control
rlm@405	192 CLOSED: [2014-03-17 Mon 17:42] SCHEDULED: <2014-03-17 Mon>
rlm@405	193 CLOCK: [2014-03-17 Mon 17:31]--[2014-03-17 Mon 17:42] => 0:11
rlm@407	194
rlm@409	195 *** DONE complete automatic touch partitioning
rlm@409	196 CLOSED: [2014-03-18 Tue 21:43] SCHEDULED: <2014-03-18 Tue>
rlm@415	197 *** DONE complete cyclic predicate
rlm@415	198 CLOSED: [2014-03-19 Wed 16:34] SCHEDULED: <2014-03-18 Tue>
rlm@415	199 CLOCK: [2014-03-19 Wed 13:16]--[2014-03-19 Wed 16:34] => 3:18
rlm@415	200 *** DONE complete three phi-stream action predicatates; test them with debug control
rlm@415	201 CLOSED: [2014-03-19 Wed 16:35] SCHEDULED: <2014-03-17 Mon>
rlm@415	202 CLOCK: [2014-03-18 Tue 18:36]--[2014-03-18 Tue 21:43] => 3:07
rlm@407	203 CLOCK: [2014-03-18 Tue 18:34]--[2014-03-18 Tue 18:36] => 0:02
rlm@407	204 CLOCK: [2014-03-17 Mon 19:19]--[2014-03-17 Mon 21:19] => 2:00
rlm@415	205 *** DONE build an automatic "do all the things" sequence.
rlm@415	206 CLOSED: [2014-03-19 Wed 16:55] SCHEDULED: <2014-03-19 Wed>
rlm@415	207 CLOCK: [2014-03-19 Wed 16:53]--[2014-03-19 Wed 16:55] => 0:02
rlm@417	208 *** DONE implement proprioception based movement lookup in phi-space
rlm@417	209 CLOSED: [2014-03-19 Wed 22:04] SCHEDULED: <2014-03-19 Wed>
rlm@417	210 CLOCK: [2014-03-19 Wed 19:32]--[2014-03-19 Wed 22:04] => 2:32
rlm@418	211 *** DONE make proprioception reference phi-space indexes
rlm@418	212 CLOSED: [2014-03-19 Wed 22:47] SCHEDULED: <2014-03-19 Wed>
rlm@417	213
rlm@415	214
rlm@420	215 *** DONE create test videos, also record positions of worm segments
rlm@420	216 CLOSED: [2014-03-20 Thu 22:02] SCHEDULED: <2014-03-19 Wed>
rlm@402	217
rlm@403	218 *** TODO Collect intro, worm-learn and cortex creation into draft thesis.
rlm@405	219

Mercurial > cortex

annotate thesis/org/roadmap.org @ 553:20f64a70f8c5