cs.LG(2023-12-02)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 A Survey of Temporal Credit Assignment in Deep Reinforcement Learning 深度强化学习中时间信用分配问题综述:形式化、挑战与评估 reinforcement learning deep reinforcement learning
2 Harnessing Discrete Representations For Continual Reinforcement Learning 利用离散表示提升持续强化学习性能 reinforcement learning world model
3 RLHF and IIA: Perverse Incentives 揭示RLHF中IIA假设导致的偏好错位激励问题 reinforcement learning RLHF

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
4 Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis 提出基于大语言模型驱动的Verilog开发框架,优化代码功耗、性能和面积 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页