Multiscale Sensor Fusion and Continuous Control with Neural CDEs
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)(2022)
摘要
Though robot learning is often formulated in terms of discrete-time Markov decision processes (MDPs), physical robots require near-continuous multiscale feedback control. Machines operate on multiple asynchronous sensing modalities, each with different frequencies, e.g., video frames at 30Hz, proprioceptive state at 100Hz, force-torque data at 500Hz, etc. While the classic approach is to batch observations into fixed-time windows then pass them through feed-forward encoders (e.g., with deep networks), we show that there exists a more elegant approach - one that treats policy learning as modeling latent state dynamics in continuous-time. Specifically, we present InFuser, a unified architecture that trains continuous time-policies with Neural Controlled Differential Equations (CDEs). InFuser evolves a single latent state representation over time by (In)tegrating and (Fus)ing multi-sensory observations (arriving at different frequencies), and inferring actions in continuous-time. This enables policies that can react to multi-frequency multi-sensory feedback for truly end-to-end visuomotor control, without discrete-time assumptions. Behavior cloning experiments demonstrate that InFuser learns robust policies for dynamic tasks (e.g., swinging a ball into a cup) notably outperforming several baselines in settings where observations from one sensing modality can arrive at much sparser intervals than others.
更多查看译文
关键词
continuous control,discrete-time assumptions,discrete-time Markov decision processes,fixed-time windows,force-torque data,frequency 100.0 Hz,frequency 30.0 Hz,frequency 500.0 Hz,InFuser,latent state dynamics,multifrequency multisensory feedback,multiple asynchronous sensing modalities,multiscale sensor fusion,near-continuous multiscale feedback control,Neural CDEs,neural controlled differential equations,policy learning,proprioceptive state,robot learning,robust policies,sensing modality,single latent state representation,video frames,visuomotor control
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn