cs.AI(2025-09-02)

📊 共 5 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱九:具身大模型 (Embodied Foundation Models) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 The Landscape of Agentic Reinforcement Learning for LLMs: A Survey 提出Agentic RL框架,将LLM从序列生成器转变为自主决策智能体,并全面综述其能力、应用与未来方向。 reinforcement learning large language model
2 UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning UI-TARS-2:通过多轮强化学习提升GUI智能体性能,实现更强的泛化能力 reinforcement learning
3 How Real Is AI Tutoring? Comparing Simulated and Human Dialogues in One-on-One Instruction 对比AI与人类辅导对话,揭示当前AI在教学互动深度上的局限性 teacher-student large language model

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
4 AppCopilot: Toward General, Accurate, Long-Horizon, and Efficient Mobile Agent AppCopilot:面向通用、精确、长程和高效的移动Agent large language model foundation model multimodal
5 Oyster-I: Beyond Refusal -- Constructive Safety Alignment for Responsible Language Models Oyster-I:超越拒绝,为负责任的语言模型构建建设性安全对齐 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页