中文记忆破碎:利用退化生成检测与缓解扩散模型中的记忆化
ENBroken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations
扩散模型生成高质量图像时易记忆训练数据,引发隐私和版权风险。研究首次发现记忆导致内部数值不稳定,产生视觉“破碎”伪影。受数值方法稳定性分析启发,基于潜在更新范数定义经验稳定区域,量化生成稳定性。提出在线稳定启动策略,有效提升隐私保护且不牺牲图像质量。
arXiv:2605.22050v2 Announce Type: replace Abstract: While diffusion models excel at generating high-quality images, their tendency to memorize training data poses significant privacy and copyright risks. In this work, we for the first time identify that memorization induces internal numerical instability, often manifesting as visually ``broken'' artifacts. Inspired by stability analysis in numerical methods, we introduce empirical stability regions based on latent update norms to quantitatively characterize stable behavior during generation. Leveraging this, we propose a principled, on-the-fly