cs.AI(2026-01-07)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | MobileDreamer: Generative Sketch World Model for GUI Agent | MobileDreamer:为GUI代理构建生成式草图世界模型,提升长时任务性能。 | world model dreamer | ||
| 11 | Interleaved Tool-Call Reasoning for Protein Function Understanding | 提出PFUA:一种交错工具调用的蛋白质功能理解框架,显著提升预测性能。 | reinforcement learning large language model chain-of-thought | ||
| 12 | ReEfBench: Quantifying the Reasoning Efficiency of LLMs | 提出ReEfBench框架以量化大型语言模型的推理效率 | distillation large language model chain-of-thought | ||
| 13 | Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models | 提出动态离群点截断(DOT)方法,解决推理模型训练中的长度偏移问题,提升效率与性能。 | reinforcement learning chain-of-thought | ||
| 14 | ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition | 提出ROI-Reasoning,通过预计算元认知优化LLM在预算约束下的推理性能。 | reinforcement learning large language model | ||
| 15 | Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction | 提出SandwichR,通过答案-推理-答案范式实现低延迟高精度查询纠错 | reinforcement learning chain-of-thought |