cs.AI(2025-08-17)
📊 共 3 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Where to Start Alignment? Diffusion Large Language Model May Demand a Distinct Position | 提出中间令牌安全对齐方法以提升扩散大语言模型安全性 | reinforcement learning large language model | ||
| 2 | TaoSR1: The Thinking Model for E-commerce Relevance Search | 提出TaoSR1以解决电商相关性搜索中的推理不足问题 | DPO direct preference optimization large language model |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Non-Interactive Symbolic-Aided Chain-of-Thought for Logical Reasoning | 提出符号辅助链式思维以提升逻辑推理能力 | large language model chain-of-thought |