cs.LG(2023-12-02)
📊 共 4 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | A Survey of Temporal Credit Assignment in Deep Reinforcement Learning | 深度强化学习中时间信用分配问题综述:形式化、挑战与评估 | reinforcement learning deep reinforcement learning | ||
| 2 | Harnessing Discrete Representations For Continual Reinforcement Learning | 利用离散表示提升持续强化学习性能 | reinforcement learning world model | ||
| 3 | RLHF and IIA: Perverse Incentives | 揭示RLHF中IIA假设导致的偏好错位激励问题 | reinforcement learning RLHF |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis | 提出基于大语言模型驱动的Verilog开发框架,优化代码功耗、性能和面积 | large language model |