Connecting NeRFs, Images, and Text

Computer Vision and Pattern Recognition（2024）

引用 0|浏览24

摘要

Neural Radiance Fields (NeRFs) have emerged as a standard framework forrepresenting 3D scenes and objects, introducing a novel data type forinformation exchange and storage. Concurrently, significant progress has beenmade in multimodal representation learning for text and image data. This paperexplores a novel research direction that aims to connect the NeRF modality withother modalities, similar to established methodologies for images and text. Tothis end, we propose a simple framework that exploits pre-trained models forNeRF representations alongside multimodal models for text and image processing.Our framework learns a bidirectional mapping between NeRF embeddings and thoseobtained from corresponding images and text. This mapping unlocks several noveland useful applications, including NeRF zero-shot classification and NeRFretrieval from images or text.

查看译文

关键词

3D Computer Vision,Neural Fields

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

您的评分 :

暂无评分

数据免责声明

页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果，我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问，可以通过电子邮件方式联系我们：report@aminer.cn