cs.LG(2025-06-23)

📊 共 6 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (4) 支柱二:RL算法与架构 (RL & Architecture) (2 🔗2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
1 Thought Anchors: Which LLM Reasoning Steps Matter? 提出思维锚点方法以解析大型语言模型的推理过程 chain-of-thought
2 ReDit: Reward Dithering for Improved LLM Policy Optimization 提出ReDit以解决LLM优化中的离散奖励问题 large language model
3 No Training Wheels: Steering Vectors for Bias Correction at Inference Time 提出无训练方法以解决分类模型偏差问题 large language model
4 LLMs on a Budget? Say HOLA 提出HOLA框架以高效部署大型语言模型 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
5 SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation 提出SlimMoE以解决大规模MoE模型的压缩与部署问题 distillation large language model
6 Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning 提出Confucius3-Math以解决中国K-12数学学习问题 reinforcement learning large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页