cs.AI(2025-09-02)
📊 共 5 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | The Landscape of Agentic Reinforcement Learning for LLMs: A Survey | 提出Agentic RL框架,将LLM从序列生成器转变为自主决策智能体,并全面综述其能力、应用与未来方向。 | reinforcement learning large language model | ||
| 2 | UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning | UI-TARS-2:通过多轮强化学习提升GUI智能体性能,实现更强的泛化能力 | reinforcement learning | ||
| 3 | How Real Is AI Tutoring? Comparing Simulated and Human Dialogues in One-on-One Instruction | 对比AI与人类辅导对话,揭示当前AI在教学互动深度上的局限性 | teacher-student large language model |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | AppCopilot: Toward General, Accurate, Long-Horizon, and Efficient Mobile Agent | AppCopilot:面向通用、精确、长程和高效的移动Agent | large language model foundation model multimodal | ||
| 5 | Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models | Oyster-I:超越拒绝,为负责任的语言模型构建建设性安全对齐 | large language model |