cs.RO(2025-12-17)
📊 共 11 篇论文
🎯 兴趣领域导航
支柱一:机器人控制 (Robot Control) (5)
支柱三:空间感知与语义 (Perception & Semantics) (4)
支柱九:具身大模型 (Embodied Foundation Models) (1)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱一:机器人控制 (Robot Control) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs | 提出mimic-video以解决机器人控制中的物理理解问题 | manipulation flow matching vision-language-action | ||
| 2 | ISS Policy : Scalable Diffusion Policy with Implicit Scene Supervision | 提出隐式场景监督扩散策略,提升机器人操作任务的泛化性和训练效率 | manipulation dexterous hand imitation learning | ||
| 3 | GuangMing-Explorer: A Four-Legged Robot Platform for Autonomous Exploration in General Environments | GuangMing-Explorer:用于通用环境自主探索的四足机器人平台 | legged robot | ||
| 4 | QuantGraph: A Receding-Horizon Quantum Graph Solver | 提出QuantGraph,一种基于后退视界的量子图求解器,提升图优化效率。 | model predictive control | ||
| 5 | Load-Based Variable Transmission Mechanism for Robotic Applications | 提出基于负载的可变传动机制,提升机器人关节在动态负载下的性能 | legged robot |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | VLA-AN: An Efficient and Onboard Vision-Language-Action Framework for Aerial Navigation in Complex Environments | 提出VLA-AN,用于复杂环境中无人机高效、安全的视觉-语言-动作端到端导航。 | 3D gaussian splatting gaussian splatting splatting | ||
| 7 | OMCL: Open-vocabulary Monte Carlo Localization | 提出基于视觉-语言特征的开放词汇蒙特卡洛定位方法,提升跨模态地图环境下的机器人定位鲁棒性。 | open-vocabulary open vocabulary | ||
| 8 | NAP3D: NeRF Assisted 3D-3D Pose Alignment for Autonomous Vehicles | NAP3D:NeRF辅助的3D-3D位姿对齐,用于提升自动驾驶车辆定位精度 | NeRF neural radiance field geometric consistency | ||
| 9 | HERO: Hierarchical Traversable 3D Scene Graphs for Embodied Navigation Among Movable Obstacles | 提出HERO框架以解决动态障碍物导航问题 | traversability |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | MiVLA: Towards Generalizable Vision-Language-Action Model with Human-Robot Mutual Imitation Pre-training | MiVLA:基于人-机互模仿预训练的通用视觉-语言-动作模型 | vision-language-action VLA |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | BEV-Patch-PF: Particle Filtering with BEV-Aerial Feature Matching for Off-Road Geo-Localization | 提出BEV-Patch-PF,利用BEV特征匹配的粒子滤波实现越野环境无GPS定位 | feature matching |