cs.AI(2025-05-12)
📊 共 22 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (14)
支柱二:RL算法与架构 (RL & Architecture) (7 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (14 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models | 提出S-GRPO以解决推理模型中的过度思考问题 | reinforcement learning large language model chain-of-thought | ||
| 16 | Explainable Reinforcement Learning Agents Using World Models | 提出基于世界模型的可解释强化学习代理以解决决策透明性问题 | reinforcement learning world model model-based RL | ||
| 17 | A Survey on Collaborative Mechanisms Between Large and Small Language Models | 提出大语言模型与小语言模型协作机制以解决资源限制问题 | distillation embodied AI large language model | ||
| 18 | Online Learning-based Adaptive Beam Switching for 6G Networks: Enhancing Efficiency and Resilience | 提出在线学习的自适应波束切换以解决6G网络的稳定性问题 | reinforcement learning deep reinforcement learning DRL | ||
| 19 | Multi-source Plume Tracing via Multi-Agent Reinforcement Learning | 提出多源蒸汽追踪算法以解决工业污染源定位问题 | reinforcement learning | ||
| 20 | Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving | 提出ZeroTIR以解决数学问题求解中的工具使用挑战 | reinforcement learning large language model | ✅ | |
| 21 | Measuring General Intelligence with Generated Games | 提出gg-bench以评估语言模型的通用推理能力 | reinforcement learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 22 | Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity | 提出Comet以加速大语言模型的私密推理 | MPC spatiotemporal large language model |