cs.AI(2023-12-30)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 Contrastive learning-based agent modeling for deep reinforcement learning 提出基于对比学习的智能体建模方法CLAM,提升多智能体强化学习的适应性策略。 reinforcement learning deep reinforcement learning contrastive learning
2 Principal-Agent Reward Shaping in MDPs 提出MDP框架下的委托代理奖励塑造方法,提升委托人效用 reward shaping

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
3 Is Knowledge All Large Language Models Needed for Causal Reasoning? 提出因果归因模型,评估大语言模型因果推理对知识和数据的依赖性 large language model
4 ConfusionPrompt: Practical Private Inference for Online Large Language Models ConfusionPrompt:一种实用的在线大语言模型隐私推理框架 large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页