cs.AI(2026-04-10)
📊 共 18 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (13 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | PilotBench: A Benchmark for General Aviation Agents with Safety Constraints | PilotBench:面向通用航空代理,带安全约束的基准测试 | MAE embodied AI large language model | ||
| 15 | SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks | 提出SPPO以解决长时间推理任务中的PPO不稳定问题 | PPO large language model chain-of-thought | ||
| 16 | Advantage-Guided Diffusion for Model-Based Reinforcement Learning | 提出Advantage引导的扩散模型(AGD-MBRL),提升基于扩散模型的模型强化学习性能。 | reinforcement learning PPO world model | ||
| 17 | On the Representational Limits of Quantum-Inspired 1024-D Document Embeddings: An Experimental Evaluation Framework | 评估量子启发式1024维文档嵌入的表征能力极限,揭示其在信息检索中的局限性 | teacher-student distillation large language model | ||
| 18 | StaRPO: Stability-Augmented Reinforcement Policy Optimization | 提出StaRPO,通过增强推理稳定性提升大型语言模型在复杂推理任务中的性能。 | reinforcement learning large language model |