GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs

CVPR 2024(2024)

引用 0|浏览30
摘要
Vision graph neural networks (ViG) offer a new avenue for exploration incomputer vision. A major bottleneck in ViGs is the inefficient k-nearestneighbor (KNN) operation used for graph construction. To solve this issue, wepropose a new method for designing ViGs, Dynamic Axial Graph Construction(DAGC), which is more efficient than KNN as it limits the number of consideredgraph connections made within an image. Additionally, we propose a novelCNN-GNN architecture, GreedyViG, which uses DAGC. Extensive experiments showthat GreedyViG beats existing ViG, CNN, and ViT architectures in terms ofaccuracy, GMACs, and parameters on image classification, object detection,instance segmentation, and semantic segmentation tasks. Our smallest model,GreedyViG-S, achieves 81.1Vision GNN and 2.2less GMACs and a similar number of parameters. Our largest model, GreedyViG-Bobtains 83.9decrease in parameters and a 69the same accuracy as ViHGNN with a 67.3decrease in GMACs. Our work shows that hybrid CNN-GNN architectures not onlyprovide a new avenue for designing efficient models, but that they can alsoexceed the performance of current state-of-the-art models.
更多
查看译文
关键词
Efficient Computer Vision,Deep Learning,Graph Neural Networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
0
您的评分 :

暂无评分

数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn