Discover and Mitigate Multiple Biased Subgroups in Image Classifiers

Computer Vision and Pattern Recognition(2024)

引用 0|浏览23
摘要
Machine learning models can perform well on in-distribution data but oftenfail on biased subgroups that are underrepresented in the training data,hindering the robustness of models for reliable applications. Such subgroupsare typically unknown due to the absence of subgroup labels. Discovering biasedsubgroups is the key to understanding models' failure modes and furtherimproving models' robustness. Most previous works of subgroup discovery make animplicit assumption that models only underperform on a single biased subgroup,which does not hold on in-the-wild data where multiple biased subgroups exist. In this work, we propose Decomposition, Interpretation, and Mitigation (DIM),a novel method to address a more challenging but also more practical problem ofdiscovering multiple biased subgroups in image classifiers. Our approachdecomposes the image features into multiple components that represent multiplesubgroups. This decomposition is achieved via a bilinear dimension reductionmethod, Partial Least Square (PLS), guided by useful supervision from the imageclassifier. We further interpret the semantic meaning of each subgroupcomponent by generating natural language descriptions using vision-languagefoundation models. Finally, DIM mitigates multiple biased subgroupssimultaneously via two strategies, including the data- and model-centricstrategies. Extensive experiments on CIFAR-100 and Breeds datasets demonstratethe effectiveness of DIM in discovering and mitigating multiple biasedsubgroups. Furthermore, DIM uncovers the failure modes of the classifier onHard ImageNet, showcasing its broader applicability to understanding model biasin image classifiers. The code is available athttps://github.com/ZhangAIPI/DIM.
更多
查看译文
关键词
Image Classification,Multiple Groups,Machine Learning Models,Partial Least Squares,Failure Modes,Partial Least,Foundation Model,CIFAR-100 Dataset,Specific Categories,Similarity Score,Visual Features,Latent Space,Negative Direction,Dolphins,Image Retrieval,Inferior Performance,Directions In Space,Stages Of Decomposition,Task Of Finding,Partial Least Squares Method,Multiple Biases,Unknown Bias,Soft Labels,Aquatic Mammals,Dynamic Training,Role Of Supervision
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
0
您的评分 :

暂无评分

数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn