中文Uni-Edit:智能编辑是统一模型调优的通用任务
ENUni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
现有统一多模态模型(UMM)通常采用混合多任务训练,导致任务冲突与性能折衷。本文提出Uni-Edit,将智能图像编辑作为首个通用调谐任务,取代复杂混合流程,从根源避免冲突,实现真正相互增强,简化训练并提升效果。
arXiv:2605.21487v2 Announce Type: replace Abstract: Currently, enhancing Unified Multimodal Models (UMMs) with image understanding, generation, and editing capabilities mainly relies on mixed multi-task training. Due to inherent task conflicts, such strategy requires complex multi-stage pipelines, massive data mixing, and balancing tricks, merely resulting in a performance trade-off rather than true mutual reinforcement. To break this paradigm, we propose Uni-Edit, an intelligent image editing task that serves as the first general task for UMM tuning. Unlike complex mixed pipelines, Uni-Edit i