中文VisAnalog:面向自然图像视觉概念迁移的诊断套件
ENVisAnalog: A Diagnostic Suite for Visual Concept Transfer on Natural Images
我们提出VisAnalog,一个用于评估视觉概念学习的自然图像类比推理测试集。通过A:B::C:?形式,要求模型识别并应用相同变换序列从A到B和C到D。该方法能有效检测模型在变换下保留概念属性并迁移至新场景的能力,为视觉概念学习提供更严格的评测基准。
arXiv:2605.23141v1 Announce Type: new Abstract: A useful test of visual concept learning is not just whether a model can recognize a concept in a single image, but whether it can preserve and manipulate concept-level properties under transformation and transfer them to new scenes. We introduce VisAnalog, a controlled suite for this setting on natural images. Each example instantiates $A\!:\!B::C\!:\,?$: images $B$ and a hidden target image $D$ are produced by applying the same deterministic transformation sequence to source images $A$ and $C$. Given $A$, $B$, and $C$, a model must answer a mul