中文PROGRESSLM: 迈向视觉语言模型中的进度推理

ENPROGRESSLM: Towards Progress Reasoning in Vision-Language Models

arXiv cs.CV2026年5月25日

arXiv:2601.15224v2 提出Progress-Bench基准，系统评估视觉语言模型从部分观测中推理任务进展的能力。研究探索了人类启发的两阶段推理范式（无训练提示及训练方法），突破仅识别静态视觉内容的局限，为手术机器人等长程动态任务的进展监控提供新思路。

arXiv:2601.15224v2 Announce Type: replace Abstract: Estimating task progress requires reasoning over long-horizon dynamics rather than recognizing static visual content. While modern Vision-Language Models (VLMs) excel at describing what is visible, it remains unclear whether they can infer how far a task has progressed from partial observations. To this end, we introduce Progress-Bench, a benchmark for systematically evaluating progress reasoning in VLMs. Beyond benchmarking, we further explore a human-inspired two-stage progress reasoning paradigm through both training-free prompting and tra