cs.LG(2023-12-16)

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)

#题目一句话要点标签🔗
1 Advancing RAN Slicing with Offline Reinforcement Learning 提出离线强化学习方法,提升无线接入网切片中的资源管理效率。 reinforcement learning offline RL offline reinforcement learning
2 Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing 提出分数阶深度强化学习,解决移动边缘计算中时延敏感任务的AoI最小化问题。 reinforcement learning deep reinforcement learning DRL
3 RedCore: Relative Advantage Aware Cross-modal Representation Learning for Missing Modalities with Imbalanced Missing Rates 提出RedCore,解决多模态学习中模态缺失和不平衡问题。 representation learning multimodal
4 Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System 针对双积分器系统,分析策略网络在扩展状态空间下的泛化性能退化问题 reinforcement learning deep reinforcement learning DRL
5 Rethinking Dimensional Rationale in Graph Contrastive Learning from Causal Perspective 提出维度理性图对比学习方法,从因果视角提升图表示的判别性和迁移性 contrastive learning
6 Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning 提出一种基于模仿学习的增量式安全强化学习方法,避免轨迹成本约束的过度估计或低估。 reinforcement learning
7 Active Reinforcement Learning for Robust Building Control 提出ActivePLR算法,用于鲁棒建筑控制的主动强化学习。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
8 PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU PowerInfer:利用消费级GPU实现快速大语言模型推理服务 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页