Positive Grid Spark Instruction

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Abstract: Vision-And-Language Navigation (VLN) suffers from the limited diversity and scale of training data, primarily constrained by the manual curation of existing simulators. To address this, we ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果

反馈

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

今日热点