中文扩散教师模型的期望方差缩减
ENVariance Reduction for Expectations with Diffusion Teachers
CARV提出一种计算感知的方差核算框架,通过分层蒙特卡洛估计器替代传统方法,在扩散模型作为冻结教师的场景中,分摊昂贵的上游工作(如渲染、仿真、编码),从而降低梯度估计的方差和计算成本。该方法显著提升了文本到3D、单步蒸馏等下游管道的效率。
arXiv:2605.21489v2 Announce Type: replace-cross Abstract: Pretrained diffusion models serve as frozen teachers feeding downstream pipelines such as text-to-3D, single-step distillation, and data attribution. The teacher gradients these pipelines consume are Monte Carlo (MC) expectations over noise levels and Gaussian noise samples; their estimator variance dominates compute cost because each draw requires expensive upstream work (rendering, simulation, encoding). We introduce CARV, a compute-aware variance-accounting framework that motivates a hierarchical MC estimator: amortize the expensive