| 1 |
PENDULUM: A Benchmark for Assessing Sycophancy in Multimodal Large Language Models |
提出PENDULUM基准,评估多模态大语言模型中的谄媚现象 |
large language model multimodal |
✅ |
|
| 2 |
Understanding Chain-of-Thought in Large Language Models via Topological Data Analysis |
利用拓扑数据分析理解大语言模型中的思维链 |
large language model chain-of-thought |
|
|
| 3 |
FC-MIR: A Mobile Screen Awareness Framework for Intent-Aware Recommendation based on Frame-Compressed Multimodal Trajectory Reasoning |
提出FC-MIR框架,通过帧压缩多模态轨迹推理实现意图感知的移动屏幕推荐。 |
large language model multimodal |
|
|
| 4 |
The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge |
大型语言模型对认知的影响:重新思考集体智能与机构知识 |
large language model |
|
|
| 5 |
Clustering-based Transfer Learning for Dynamic Multimodal MultiObjective Evolutionary Algorithm |
提出基于聚类迁移学习的动态多模态多目标进化算法,解决动态环境下的多模态优化问题。 |
multimodal |
|
|
| 6 |
VIGOR+: Iterative Confounder Generation and Validation via LLM-CEVAE Feedback Loop |
VIGOR+:提出基于LLM-CEVAE反馈环路的迭代混淆因子生成与验证框架,解决因果推断中的隐藏混淆问题。 |
large language model |
|
|
| 7 |
The Erasure Illusion: Stress-Testing the Generalization of LLM Forgetting Evaluation |
提出Erasure Illusion框架,用于压力测试LLM遗忘评估的泛化能力。 |
large language model |
|
|
| 8 |
Efficient Jailbreak Mitigation Using Semantic Linear Classification in a Multi-Staged Pipeline |
提出一种基于语义线性分类的多阶段流水线,高效缓解大语言模型的越狱攻击。 |
large language model |
|
|
| 9 |
A Dataset and Preliminary Study of Using GPT-5 for Code-change Impact Analysis |
构建代码变更影响分析数据集,初步评估GPT-5在代码影响预测中的能力。 |
large language model |
|
|
| 10 |
An Agentic Framework for Autonomous Materials Computation |
提出基于Agent的材料计算框架,实现第一性原理计算的可靠自动化。 |
large language model |
|
|
| 11 |
Causal-Guided Detoxify Backdoor Attack of Open-Weight LoRA Models |
提出CBA:一种因果引导的LoRA模型解毒后门攻击方法 |
large language model |
|
|
| 12 |
Observer, Not Player: Simulating Theory of Mind in LLMs through Game Observation |
提出基于观察者模式的框架,通过石头剪刀布游戏评估LLM的心理理论能力 |
large language model |
|
|
| 13 |
Population-Evolve: a Parallel Sampling and Evolutionary Method for LLM Math Reasoning |
提出Population-Evolve,一种基于遗传算法的LLM数学推理优化方法 |
large language model |
|
|