中文ComPose：何时信任手部进行物体姿态追踪

ENComPose: When to Trust Hands for Object Pose Tracking

arXiv cs.CV2026年5月25日

ComPose提出一种手部感知的6DoF物体跟踪框架，从RGB视频中估计姿态，将手视为交互而非纯遮挡，显著提升手部严重遮挡下的鲁棒性，适用于机器人操作。

arXiv:2605.23523v1 Announce Type: new Abstract: Reconstructing the motion of objects from videos is a key component for embodied AI and robot manipulation. While diverse approaches to object pose tracking have been studied, they rely heavily on strong external priors, such as depth data or 3D templates, and remain highly vulnerable to severe occlusions by hand grasps despite the use of explicit masks. In this work, we present ComPose, a 6DoF object tracking framework designed for hand-aware object pose estimation from RGB video. Rather than treating the hand purely as an occluder, our method h