ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing
CVPR 2024(2024)
摘要
This paper proposes ConsistDreamer - a novel framework that lifts 2Ddiffusion models with 3D awareness and 3D consistency, thus enablinghigh-fidelity instruction-guided scene editing. To overcome the fundamentallimitation of missing 3D consistency in 2D diffusion models, our key insight isto introduce three synergetic strategies that augment the input of the 2Ddiffusion model to become 3D-aware and to explicitly enforce 3D consistencyduring the training process. Specifically, we design surrounding views ascontext-rich input for the 2D diffusion model, and generate 3D-consistent,structured noise instead of image-independent noise. Moreover, we introduceself-supervised consistency-enforcing training within the per-scene editingprocedure. Extensive evaluation shows that our ConsistDreamer achievesstate-of-the-art performance for instruction-guided scene editing acrossvarious scenes and editing instructions, particularly in complicatedlarge-scale indoor scenes from ScanNet++, with significantly improved sharpnessand fine-grained textures. Notably, ConsistDreamer stands as the first workcapable of successfully editing complex (e.g., plaid/checkered) patterns. Ourproject page is at immortalco.github.io/ConsistDreamer.
更多查看译文
关键词
2D Diffusion,Scene Editing,Diffusion Model,Noise Structure,Project Page,Synergistic Strategy,Consistent Results,Gaussian Noise,Point Cloud,Kullback-Leibler,General Education,Latent Space,Small Imaging,3D Information,Consistency Loss,3D Scene,Single View,Outdoor Scenes,Style Transfer,Multi-view Images,Main View,Fréchet Inception Distance,Edit Operations,Van Gogh,Original View,Scene Geometry,Objects In The Scene
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn