cs.LG（2025-09-10）

📊 共 13 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (4) 支柱一：机器人控制 (Robot Control) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Merge-of-Thought Distillation	提出Merge-of-Thought Distillation，解决长链CoT模型蒸馏中多教师冲突问题。	distillation chain-of-thought
2	Tokenizing Loops of Antibodies	提出Igloo抗体环区Tokenizer，提升蛋白语言模型性能并促进抗体设计	contrastive learning foundation model multimodal
3	NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment	提出NurseSchedRL以解决护士-患者分配问题	reinforcement learning PPO
4	AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning	AgentGym-RL：通过多轮强化学习训练LLM智能体进行长程决策	reinforcement learning
5	Replicable Reinforcement Learning with Linear Function Approximation	针对线性函数逼近的强化学习，提出可复现算法以提升实验一致性。	reinforcement learning
6	MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents	提出MobileRL框架，通过在线强化学习提升移动GUI智能体的任务完成能力	reinforcement learning	✅
7	Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform	提出基于深度强化学习的决策支持AI，优化大规模伤亡事件中的患者转运和资源利用。	reinforcement learning deep reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (4 篇)

#	题目	一句话要点	标签	🔗	⭐
8	LLM-VeriPPA: Power, Performance, and Area Optimization aware Verilog Code Generation with Large Language Models	VeriPPA：利用大语言模型实现功耗、性能和面积优化的Verilog代码生成	large language model
9	Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics	利用视觉-语言模型进行高能物理中微子事件分类	large language model multimodal
10	Deploying AI for Signal Processing education: Selected challenges and intriguing opportunities	探索AI在信号处理教育中的应用：挑战与机遇	large language model
11	ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System	ChemBOMAS：利用LLM增强的多智能体系统加速化学领域的贝叶斯优化	large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
12	Fast attention mechanisms: a tale of parallelism	提出ANNA注意力机制，加速Transformer并保持大规模并行计算能力	MPC

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Energy-convergence trade off for the training of neural networks on bio-inspired hardware	提出能量收敛权衡方法以优化神经网络训练	PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页