中文PROGRESSLM: 迈向视觉语言模型中的进度推理
ENPROGRESSLM: Towards Progress Reasoning in Vision-Language Models
arXiv:2601.15224v2 提出Progress-Bench基准,系统评估视觉语言模型从部分观测中推理任务进展的能力。研究探索了人类启发的两阶段推理范式(无训练提示及训练方法),突破仅识别静态视觉内容的局限,为手术机器人等长程动态任务的进展监控提供新思路。
arXiv:2601.15224v2 Announce Type: replace Abstract: Estimating task progress requires reasoning over long-horizon dynamics rather than recognizing static visual content. While modern Vision-Language Models (VLMs) excel at describing what is visible, it remains unclear whether they can infer how far a task has progressed from partial observations. To this end, we introduce Progress-Bench, a benchmark for systematically evaluating progress reasoning in VLMs. Beyond benchmarking, we further explore a human-inspired two-stage progress reasoning paradigm through both training-free prompting and tra