cs.RO - arXiv 学术档案

cs.LG 2026-06-17

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on pa...

Mohamed Nabail, Leo Cheng, Jingmin Wang 等

详情 PDF

cs.LG 2026-06-17

Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models

Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain a...

Nikita Kachaev, Andrey Moskalenko, Matvey Skripkin 等

详情 PDF

cs.CV 2026-06-17

OneCanvas: 3D Scene Understanding via Panoramic Reprojection

Existing approaches to 3D scene understanding in Vision-Language Models (VLMs) either rely on complex, model-specific geometry encoders or large training budgets in pursuit of spatial reasoning. Inste...

Bartłomiej Baranowski, Dave Zhenyu Chen, Matthias Nießner

详情 PDF