cs.LG(2025-09-10)

📊 共 13 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (4) 支柱一:机器人控制 (Robot Control) (1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)

#题目一句话要点标签🔗
1 Merge-of-Thought Distillation 提出Merge-of-Thought Distillation,解决长链CoT模型蒸馏中多教师冲突问题。 distillation chain-of-thought
2 Tokenizing Loops of Antibodies 提出Igloo抗体环区Tokenizer,提升蛋白语言模型性能并促进抗体设计 contrastive learning foundation model multimodal
3 NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment 提出NurseSchedRL以解决护士-患者分配问题 reinforcement learning PPO
4 AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning AgentGym-RL:通过多轮强化学习训练LLM智能体进行长程决策 reinforcement learning
5 Replicable Reinforcement Learning with Linear Function Approximation 针对线性函数逼近的强化学习,提出可复现算法以提升实验一致性。 reinforcement learning
6 MobileRL: Online Agentic Reinforcement Learning for Mobile GUI Agents 提出MobileRL框架,通过在线强化学习提升移动GUI智能体的任务完成能力 reinforcement learning
7 Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform 提出基于深度强化学习的决策支持AI,优化大规模伤亡事件中的患者转运和资源利用。 reinforcement learning deep reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
8 LLM-VeriPPA: Power, Performance, and Area Optimization aware Verilog Code Generation with Large Language Models VeriPPA:利用大语言模型实现功耗、性能和面积优化的Verilog代码生成 large language model
9 Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics 利用视觉-语言模型进行高能物理中微子事件分类 large language model multimodal
10 Deploying AI for Signal Processing education: Selected challenges and intriguing opportunities 探索AI在信号处理教育中的应用:挑战与机遇 large language model
11 ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System ChemBOMAS:利用LLM增强的多智能体系统加速化学领域的贝叶斯优化 large language model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
12 Fast attention mechanisms: a tale of parallelism 提出ANNA注意力机制,加速Transformer并保持大规模并行计算能力 MPC

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
13 Energy-convergence trade off for the training of neural networks on bio-inspired hardware 提出能量收敛权衡方法以优化神经网络训练 PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页