| 1 |
Can Large Language Models Develop Gambling Addiction? |
研究发现大语言模型可能表现出类似人类赌博成瘾的行为模式 |
large language model |
|
|
| 2 |
UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration |
UniMIC:面向人机协作的Token化多模态交互编码框架 |
multimodal |
|
|
| 3 |
The Emergence of Altruism in Large-Language-Model Agents Society |
提出基于Schelling模型的LLM智能体社会模拟框架,揭示利他主义涌现机制与模型异质性。 |
large language model |
|
|
| 4 |
Large Language Models as Nondeterministic Causal Models |
提出基于非确定性因果模型的大语言模型反事实生成方法 |
large language model |
|
|
| 5 |
Patient-specific Biomolecular Instruction Tuning |
提出KRONOS图-LLM框架,结合CPTAC-PROTSTRUCT数据集,提升肿瘤精准医疗中患者个体化蛋白质组学理解。 |
large language model multimodal |
|
|
| 6 |
Guiding Evolution of Artificial Life Using Vision-Language Models |
ASAL++:利用视觉-语言模型引导人工生命演化,实现开放式探索 |
foundation model multimodal |
|
|
| 7 |
You Can't Steal Nothing: Mitigating Prompt Leakages in LLMs via System Vectors |
提出SysVec,通过系统向量编码缓解大语言模型中的提示词泄露问题 |
large language model instruction following |
|
|
| 8 |
Efficient Fine-Grained GPU Performance Modeling for Distributed Deep Learning of LLM |
提出一种高效细粒度的GPU性能建模方法,用于预测LLM分布式训练性能。 |
large language model |
|
|
| 9 |
Hilbert: Recursively Building Formal Proofs with Informal Reasoning |
Hilbert:结合非形式推理与形式验证,递归构建数学证明 |
large language model |
|
|
| 10 |
Dynamic Experts Search: Enhancing Reasoning in Mixture-of-Experts LLMs at Test Time |
提出动态专家搜索(DES),提升MoE LLM在推理时的性能和稳定性。 |
large language model |
|
|
| 11 |
TrueGradeAI: Retrieval-Augmented and Bias-Resistant AI for Transparent and Explainable Digital Assessments |
TrueGradeAI:一种检索增强且抗偏置的透明可解释AI数字评估框架 |
large language model |
|
|
| 12 |
Estimating the Empowerment of Language Model Agents |
提出EELMA算法,通过信息论中的Empowerment评估语言模型Agent的能力。 |
chain-of-thought |
|
|
| 13 |
AutoPK: Leveraging LLMs and a Hybrid Similarity Metric for Advanced Retrieval of Pharmacokinetic Data from Complex Tables and Documents |
AutoPK:利用LLM和混合相似度量从复杂表格和文档中高效检索药代动力学数据 |
large language model |
✅ |
|
| 14 |
Bridging Language Models and Formal Methods for Intent-Driven Optical Network Design |
提出结合LLM与形式化方法的意图驱动光网络设计框架 |
large language model |
|
|
| 15 |
Toward a Theory of Generalizability in LLM Mechanistic Interpretability Research |
提出LLM可解释性研究中的泛化性理论框架,并验证1-back注意力头的泛化能力 |
large language model |
|
|
| 16 |
InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios |
InfiAgent:面向无限场景的自进化金字塔型智能体框架 |
large language model |
|
|
| 17 |
SecureAgentBench: Benchmarking Secure Code Generation under Realistic Vulnerability Scenarios |
SecureAgentBench:在真实漏洞场景下评估代码Agent的安全代码生成能力 |
large language model |
|
|
| 18 |
The Thinking Spectrum: An Empirical Study of Tunable Reasoning in LLMs through Model Merging |
通过模型融合实现LLM可调推理能力:大规模实证研究 |
large language model |
|
|