cs.LG（2023-12-16）

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Advancing RAN Slicing with Offline Reinforcement Learning	提出离线强化学习方法，提升无线接入网切片中的资源管理效率。	reinforcement learning offline RL offline reinforcement learning
2	Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing	提出分数阶深度强化学习，解决移动边缘计算中时延敏感任务的AoI最小化问题。	reinforcement learning deep reinforcement learning DRL
3	RedCore: Relative Advantage Aware Cross-modal Representation Learning for Missing Modalities with Imbalanced Missing Rates	提出RedCore，解决多模态学习中模态缺失和不平衡问题。	representation learning multimodal
4	Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System	针对双积分器系统，分析策略网络在扩展状态空间下的泛化性能退化问题	reinforcement learning deep reinforcement learning DRL
5	Rethinking Dimensional Rationale in Graph Contrastive Learning from Causal Perspective	提出维度理性图对比学习方法，从因果视角提升图表示的判别性和迁移性	contrastive learning	✅
6	Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning	提出一种基于模仿学习的增量式安全强化学习方法，避免轨迹成本约束的过度估计或低估。	reinforcement learning
7	Active Reinforcement Learning for Robust Building Control	提出ActivePLR算法，用于鲁棒建筑控制的主动强化学习。	reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
8	PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU	PowerInfer：利用消费级GPU实现快速大语言模型推理服务	large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页