| 1 |
CrafterDojo: A Suite of Foundation Models for Building Open-Ended Embodied Agents in Crafter |
提出CrafterDojo以解决通用体智能体研究的快速原型问题 |
foundation model instruction following |
|
|
| 2 |
Neuro-Symbolic Artificial Intelligence: Towards Improving the Reasoning Abilities of Large Language Models |
提出神经符号人工智能以提升大型语言模型的推理能力 |
large language model |
✅ |
|
| 3 |
Large Language Models are Highly Aligned with Human Ratings of Emotional Stimuli |
探讨大型语言模型与人类情感评估的高度一致性 |
large language model |
|
|
| 4 |
UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion |
提出UniECS以解决电商多模态检索系统的局限性 |
multimodal |
✅ |
|
| 5 |
COMPASS: A Multi-Dimensional Benchmark for Evaluating Code Generation in Large Language Models |
提出COMPASS以解决代码生成评估的多维度问题 |
large language model |
|
|
| 6 |
Equinox: Holistic Fair Scheduling in Serving Large Language Models |
提出Equinox以解决大语言模型服务中的公平调度问题 |
large language model |
|
|
| 7 |
InPars+: Supercharging Synthetic Data Generation for Information Retrieval Systems |
提出InPars+以提升神经信息检索系统的合成数据生成 |
large language model chain-of-thought |
✅ |
|
| 8 |
Explaining Hitori Puzzles: Neurosymbolic Proof Staging for Sequential Decisions |
提出神经符号方法以解释Hitori谜题的决策过程 |
large language model |
|
|
| 9 |
Incident Analysis for AI Agents |
提出AI代理事件分析框架以解决安全隐患问题 |
chain-of-thought |
|
|
| 10 |
ChronoLLM: Customizing Language Models for Physics-Based Simulation Code Generation |
提出ChronoLLM以定制语言模型生成物理仿真代码 |
large language model |
|
|
| 11 |
Prompt Orchestration Markup Language |
提出POML以解决大型语言模型提示结构与集成问题 |
large language model |
|
|
| 12 |
The Collaboration Paradox: Why Generative AI Requires Both Strategic Intelligence and Operational Stability in Supply Chain Management |
提出协作悖论以解决供应链管理中的AI行为问题 |
large language model |
|
|
| 13 |
Agentic DraCor and the Art of Docstring Engineering: Evaluating MCP-empowered LLM Usage of the DraCor API |
提出MCP服务器以优化LLM与DraCor API的交互 |
large language model |
|
|
| 14 |
The Hidden Cost of Readability: How Code Formatting Silently Consumes Your LLM Budget |
提出代码格式优化策略以降低LLM计算成本 |
large language model |
|
|
| 15 |
CCFC: Core & Core-Full-Core Dual-Track Defense for LLM Jailbreak Protection |
提出CCFC框架以解决大型语言模型的越狱攻击问题 |
large language model |
|
|