cs.RO（2026-04-06）

📊 共 32 篇论文

🎯 兴趣领域导航

支柱一：机器人控制 (Robot Control) (19) 支柱三：空间感知与语义 (Perception & Semantics) (5) 支柱二：RL算法与架构 (RL & Architecture) (5) 支柱九：具身大模型 (Embodied Foundation Models) (2) 支柱七：动作重定向 (Motion Retargeting) (1)

🔬 支柱一：机器人控制 (Robot Control) (19 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Learning Task-Invariant Properties via Dreamer: Enabling Efficient Policy Transfer for Quadruped Robots	DreamTIP：通过学习任务不变性提升四足机器人策略迁移效率	quadruped locomotion sim-to-real
2	Open-Loop Planning, Closed-Loop Verification: Speculative Verification for VLA	提出SV-VLA框架，结合开环规划与闭环验证，提升VLA控制效率与鲁棒性。	manipulation vision-language-action VLA
3	A Rapid Instrument Exchange System for Humanoid Robots in Minimally Invasive Surgery	针对人型机器人微创手术，提出快速器械交换系统。	humanoid humanoid robot dual-arm
4	Safety-Critical Centralized Nonlinear MPC for Cooperative Payload Transportation by Two Quadrupedal Robots	提出基于CBF的集中式非线性MPC，用于双足式机器人协同负载运输，确保安全。	quadruped MPC model predictive control
5	Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards	提出基于足部位置图和稳定性奖励的强化学习方法，提升四足机器人复杂地形运动能力	quadruped locomotion reinforcement learning
6	The Compression Gap: Why Discrete Tokenization Limits Vision-Language-Action Model Scaling	揭示离散token化限制VLA模型扩展的原因：压缩瓶颈理论	manipulation diffusion policy vision-language-action
7	Failure Mechanisms and Risk Estimation for Legged Robot Locomotion on Granular Slopes	针对腿式机器人沙坡行走，提出基于力学建模的失效风险评估方法	legged robot locomotion penetration
8	FSUNav: A Cerebrum-Cerebellum Architecture for Fast, Safe, and Universal Zero-Shot Goal-Oriented Navigation	FSUNav：基于大脑-小脑架构的快速、安全、通用零样本目标导向导航	quadruped humanoid reinforcement learning
9	ARM: Advantage Reward Modeling for Long-Horizon Manipulation	提出优势奖励建模以解决长时间机器人操控中的稀疏奖励问题	manipulation reinforcement learning offline RL
10	Tune to Learn: How Controller Gains Shape Robot Policy Learning	控制器增益影响机器人策略学习，需针对学习范式优化	manipulation sim-to-real reinforcement learning
11	ROPA: Synthetic Robot Pose Generation for RGB-D Bimanual Data Augmentation	ROPA：用于RGB-D双臂操作数据增强的合成机器人姿态生成方法	manipulation bi-manual bimanual manipulation
12	MFE: A Multimodal Hand Exoskeleton with Interactive Force, Pressure and Thermo-haptic Feedback	MFE：一种具有交互式力、压力和热触觉反馈的多模态手部外骨骼	teleoperation multimodal
13	Differentiable SpaTiaL: Symbolic Learning and Reasoning with Geometric Temporal Logic for Manipulation Tasks	提出可微SpaTiaL，用于操作任务中基于几何时序逻辑的符号学习与推理。	manipulation trajectory optimization
14	UniCon: A Unified System for Efficient Robot Learning Transfers	UniCon：一种用于高效机器人学习迁移的统一系统	sim-to-real UniCon
15	F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation	提出F2F-AP，利用预测光流解决动态操作中异步策略的时延问题	manipulation contrastive learning
16	Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model	提出MV-VDP多视角视频扩散策略，用于数据高效的3D时空感知机器人操作。	manipulation diffusion policy
17	A Dynamic Toolkit for Transmission Characteristics of Precision Reducers with Explicit Contact Geometry	提出一种动态工具包，用于分析精密减速器的传动特性，显式考虑接触几何。	quadruped humanoid humanoid robot
18	Goal-Conditioned Neural ODEs with Guaranteed Safety and Stability for Learning-Based All-Pairs Motion Planning	提出基于目标条件神经ODE的安全稳定全配对运动规划方法	motion planning
19	From Impact to Insight: Dynamics-Aware Proprioceptive Terrain Sensing on Granular Media	提出动力学感知的足端触觉方法，用于高速运动中沙土地形的稳健估计	locomotion

🔬 支柱三：空间感知与语义 (Perception & Semantics) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
20	Flash-Mono: Feed-Forward Accelerated Gaussian Splatting Monocular SLAM	Flash-Mono：基于前馈加速高斯溅射的单目SLAM，显著提升效率与精度。	3D gaussian splatting gaussian splatting splatting
21	An Open-Source LiDAR and Monocular Off-Road Autonomous Navigation Stack	提出开源的激光雷达与单目视觉越野自主导航方案，无需特定任务训练。	depth estimation monocular depth metric depth
22	ALIVE-LIO: Degeneracy-Aware Learning of Inertial Velocity for Enhancing ESKF-Based LiDAR-Inertial Odometry	ALIVE-LIO：基于学习的退化感知惯性速度估计，增强ESKF的激光雷达惯性里程计	LIO motion estimation
23	Vision-Based End-to-End Learning for UAV Traversal of Irregular Gaps via Differentiable Simulation	提出基于视觉和可微仿真的无人机复杂缝隙穿越端到端学习框架	traversability differentiable simulation
24	Terra: Hierarchical Terrain-Aware 3D Scene Graph for Task-Agnostic Outdoor Mapping	Terra：面向任务无关户外建图的分层地形感知3D场景图	traversability

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
25	OMNI-PoseX: A Fast Vision Model for 6D Object Pose Estimation in Embodied Tasks	OMNI-PoseX：用于具身任务的快速6D物体姿态估计视觉模型	flow matching open-vocabulary open vocabulary
26	Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception	提出EyeVLA，解决具身智能中语言引导的主动视觉感知问题	reinforcement learning embodied AI vision-language-action
27	Behavior-Constrained Reinforcement Learning with Receding-Horizon Credit Assignment for High-Performance Control	提出基于Receding-Horizon Credit Assignment的Behavior-Constrained强化学习，用于高性能控制。	reinforcement learning imitation learning
28	A Flow Matching Framework for Soft-Robot Inverse Dynamics	提出流匹配框架以解决软机器人逆动力学问题	flow matching
29	Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping	提出基于流匹配的轻量级驱动空间学习方法，用于软体机器人全身抓取	flow matching

🔬 支柱九：具身大模型 (Embodied Foundation Models) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
30	V2X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views	提出V2X-QA数据集与基准，评估多模态大模型在自动驾驶中跨视角推理能力。	large language model multimodal
31	Learning Structured Robot Policies from Vision-Language Models via Synthetic Neuro-Symbolic Supervision	利用合成神经符号监督，从视觉语言模型学习结构化机器人策略	foundation model multimodal

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
32	Joint Prediction of Human Motions and Actions in Human-Robot Collaboration	提出MA-HERP框架以解决人机协作中的运动与动作预测问题	human motion motion prediction

⬅️ 返回 cs.RO 首页 · 🏠 返回主页