Connecting NeRFs, Images, and Text
Computer Vision and Pattern Recognition(2024)
摘要
Neural Radiance Fields (NeRFs) have emerged as a standard framework forrepresenting 3D scenes and objects, introducing a novel data type forinformation exchange and storage. Concurrently, significant progress has beenmade in multimodal representation learning for text and image data. This paperexplores a novel research direction that aims to connect the NeRF modality withother modalities, similar to established methodologies for images and text. Tothis end, we propose a simple framework that exploits pre-trained models forNeRF representations alongside multimodal models for text and image processing.Our framework learns a bidirectional mapping between NeRF embeddings and thoseobtained from corresponding images and text. This mapping unlocks several noveland useful applications, including NeRF zero-shot classification and NeRFretrieval from images or text.
更多查看译文
关键词
3D Computer Vision,Neural Fields
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn