| 1 |
Do Large Language Models Know What They Are Capable Of? |
探讨大型语言模型的自我能力认知与决策改进 |
large language model |
|
|
| 2 |
Large language models and the entropy of English |
利用大语言模型揭示英语文本的长程结构 |
large language model |
|
|
| 3 |
Compute-Accuracy Pareto Frontiers for Open-Source Reasoning Large Language Models |
针对开源推理大语言模型,构建计算-精度帕累托前沿,优化模型选择。 |
large language model |
|
|
| 4 |
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time |
提出CREST,通过干预注意力头引导LLM推理,提升效率和准确率。 |
large language model chain-of-thought |
|
|
| 5 |
Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline |
提出ADOPT框架,自适应优化多步LLM流水线中的提示,解决依赖建模难题。 |
large language model |
|
|
| 6 |
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements |
提出Encyclo-K,通过动态组合知识语句评估LLM的综合理解能力 |
large language model |
|
|
| 7 |
Vibe Coding, Interface Flattening |
分析“Vibe Coding”范式,揭示大模型驱动软件开发中界面扁平化与控制权转移的矛盾。 |
large language model |
|
|
| 8 |
MUSIC: MUlti-Step Instruction Contrast for Multi-Turn Reward Models |
提出MUSIC:多步指令对比方法,提升多轮对话奖励模型性能 |
large language model |
|
|
| 9 |
Quantum Visual Word Sense Disambiguation: Unraveling Ambiguities Through Quantum Inference Model |
提出量子推理模型以解决视觉词义消歧问题 |
large language model |
|
|
| 10 |
Safe in the Future, Dangerous in the Past: Dissecting Temporal and Linguistic Vulnerabilities in LLMs |
揭示大语言模型在语言和时间维度上的安全漏洞,提出不变对齐方法。 |
large language model |
|
|