cs.AI(2023-12-30)
📊 共 4 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Contrastive learning-based agent modeling for deep reinforcement learning | 提出基于对比学习的智能体建模方法CLAM,提升多智能体强化学习的适应性策略。 | reinforcement learning deep reinforcement learning contrastive learning | ||
| 2 | Principal-Agent Reward Shaping in MDPs | 提出MDP框架下的委托代理奖励塑造方法,提升委托人效用 | reward shaping |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Is Knowledge All Large Language Models Needed for Causal Reasoning? | 提出因果归因模型,评估大语言模型因果推理对知识和数据的依赖性 | large language model | ||
| 4 | ConfusionPrompt: Practical Private Inference for Online Large Language Models | ConfusionPrompt:一种实用的在线大语言模型隐私推理框架 | large language model |