cs.LG(2025-06-01)

📊 共 7 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (4) 支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1) 支柱七:动作重定向 (Motion Retargeting) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)

#题目一句话要点标签🔗
1 Action Dependency Graphs for Globally Optimal Coordinated Reinforcement Learning 提出动作依赖图以解决多智能体强化学习的全局最优问题 reinforcement learning
2 Closing the Gap between TD Learning and Supervised Learning with $Q$-Conditioned Maximization 提出Q条件最大化监督学习以解决SL与TD学习间的性能差距 reinforcement learning offline RL offline reinforcement learning
3 Generalized Linear Markov Decision Process 提出广义线性马尔可夫决策过程以解决奖励信号非线性问题 reinforcement learning offline RL
4 Accelerated Learning with Linear Temporal Logic using Differentiable Simulation 提出结合可微仿真与线性时序逻辑的学习方法以解决稀疏奖励问题 reinforcement learning differentiable simulation

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
5 Uni-LoRA: One Vector is All You Need 提出Uni-LoRA框架以实现高效的参数共享与微调 large language model
6 SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs 提出SafeSteer以解决大语言模型安全调整问题 large language model

🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)

#题目一句话要点标签🔗
7 Beyond Attention: Learning Spatio-Temporal Dynamics with Emergent Interpretable Topologies 提出InterGAT以解决图注意力网络的局限性 spatial relationship

⬅️ 返回 cs.LG 首页 · 🏠 返回主页