中文不过度生成,不过度判别:人机对齐的最佳平衡点
ENNot Too Generative, Not Too Discriminative: The Human Alignment Sweet Spot
通过联合能量模型(JEM)在固定架构中连续插值判别与生成训练,分离了学习目标对视觉表征类人对齐的混杂影响。研究发现:目标本身(而非架构或数据规模)是驱动对齐的关键。该方法为理解人类视觉表征的计算原理提供了新工具。
arXiv:2605.23819v1 Announce Type: new Abstract: A central question in computational vision is whether human-like visual representations are better explained by discriminative or generative learning. Existing comparisons, however, often confound the learning objective with architecture, scale, and training data, leaving open whether the objective itself drives alignment. We address this confound using Joint Energy-Based Models (JEMs), which interpolate continuously between discriminative and generative training within a fixed architecture. By varying a single mixing coefficient, we isolate the