cs.LG(2025-05-22)
📊 共 4 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Interactive Post-Training for Vision-Language-Action Models | 提出RIPT-VLA以解决VLA模型适应性不足问题 | reinforcement learning vision-language-action VLA | ||
| 2 | Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | 提出OGSRL以解决医疗强化学习中的OOD问题 | reinforcement learning offline RL offline reinforcement learning | ||
| 3 | Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only | 提出PORL方法以解决在线强化学习微调中对Q函数的依赖问题 | reinforcement learning offline RL imitation learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | 提出滑模控制方法以解决联邦学习中的数据投毒攻击问题 | manipulation |