| 1 |
Multimodal Large Language Models Meet Multimodal Emotion Recognition and Reasoning: A Survey |
综述多模态大语言模型在情感识别与推理中的应用与挑战 |
large language model multimodal |
✅ |
|
| 2 |
Metaphor identification using large language models: A comparison of RAG, prompt engineering, and fine-tuning |
利用大型语言模型进行隐喻识别:比较RAG、提示工程和微调方法 |
large language model chain-of-thought |
|
|
| 3 |
Towards Structured Knowledge: Advancing Triple Extraction from Regional Trade Agreements using Large Language Models |
利用大型语言模型从区域贸易协定中提取结构化知识三元组 |
large language model |
|
|
| 4 |
Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding |
提出Learn2PD以解决大语言模型推理速度瓶颈问题 |
large language model |
|
|
| 5 |
Pretraining Large Language Models with NVFP4 |
提出NVFP4训练方法,实现4-bit精度下大规模语言模型的稳定高效预训练。 |
large language model |
|
|
| 6 |
GateMABSA: Aspect-Image Gated Fusion for Multimodal Aspect-based Sentiment Analysis |
提出GateMABSA模型,通过门控多模态融合解决多模态情感分析中噪声过滤和跨模态对齐问题。 |
multimodal |
|
|
| 7 |
Understanding the Dilemma of Unlearning for Large Language Models |
提出unPact框架,揭示大语言模型不可靠的知识遗忘现象与机理。 |
large language model |
|
|
| 8 |
Sanitize Your Responses: Mitigating Privacy Leakage in Large Language Models |
提出Self-Sanitize框架,缓解大语言模型中的隐私泄露问题。 |
large language model |
✅ |
|
| 9 |
CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task |
提出CDT框架,从认知、领域和任务三维度全面评估大语言模型能力。 |
large language model |
✅ |
|
| 10 |
AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment |
AlignX:通过多语言表示对齐提升多语言大语言模型性能 |
large language model |
|
|
| 11 |
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models |
DiffuGuard:揭示并修复扩散大语言模型中固有的安全漏洞 |
large language model |
✅ |
|
| 12 |
MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes |
MobileLLM-R1:通过开放训练方案探索十亿参数以下语言模型推理能力的极限 |
large language model chain-of-thought |
|
|
| 13 |
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation |
提出InfLLM-V2:一种稠密-稀疏可切换注意力机制,实现模型从短序列到长序列的无缝适应。 |
large language model chain-of-thought |
✅ |
|
| 14 |
AdaThink-Med: Medical Adaptive Thinking with Uncertainty-Guided Length Calibration |
AdaThink-Med:提出一种不确定性引导长度校准的医学自适应思考框架 |
large language model chain-of-thought |
|
|
| 15 |
Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in LLMs |
揭示LLM中内在与提示价值观表达的双重机制,并分析其差异性。 |
large language model instruction following |
|
|
| 16 |
Calibrating Verbalized Confidence with Self-Generated Distractors |
提出DINCO,通过自生成干扰项校准LLM的置信度,提升可靠性。 |
large language model |
|
|
| 17 |
Not Wrong, But Untrue: LLM Overconfidence in Document-Based Queries |
LLM在文档问答中过度自信:揭示新闻场景下的幻觉问题与溯源挑战 |
large language model |
|
|
| 18 |
The Rise of AfricaNLP: Contributions, Contributors, and Community Impact (2005-2025) |
AfricaNLP贡献分析:追踪非洲自然语言处理研究进展与社区影响 |
large language model |
|
|
| 19 |
Fingerprinting LLMs via Prompt Injection |
LLMPrint:利用Prompt注入为LLM构建鲁棒指纹,实现模型溯源 |
large language model |
|
|
| 20 |
Generative Value Conflicts Reveal LLM Priorities |
ConflictScope:揭示LLM在价值冲突下的优先级偏好 |
large language model |
|
|
| 21 |
From Internal Representations to Text Quality: A Geometric Approach to LLM Evaluation |
利用内部表征几何特性评估LLM文本质量,实现无参考文本质量评估。 |
large language model |
|
|
| 22 |
Investigating Language and Retrieval Bias in Multilingual Previously Fact-Checked Claim Detection |
研究多语言预训练模型在跨语言事实核查中的语言和检索偏差 |
large language model |
|
|
| 23 |
Learning from Convenience Samples: A Case Study on Fine-Tuning LLMs for Survey Non-response in the German Longitudinal Election Study |
微调LLM解决调查非回应问题,利用便利样本提升选举研究准确性 |
large language model |
|
|
| 24 |
Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures |
提出超维探针,通过向量符号架构解码大型语言模型表征 |
large language model |
|
|
| 25 |
How Well Do LLMs Imitate Human Writing Style? |
提出一种快速免训练框架,用于评估大型语言模型模仿人类写作风格的能力 |
large language model |
|
|
| 26 |
BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications |
BOE-XSUM:发布西班牙法律公文的明晰语言极端摘要数据集,并验证LLM微调有效性 |
large language model |
|
|
| 27 |
Expanding Computation Spaces of LLMs at Inference Time |
提出一种推理时扩展LLM计算空间的方法,提升问题解决能力 |
chain-of-thought |
|
|
| 28 |
SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching |
SemShareKV:通过Token级LSH匹配为语义相似Prompt高效共享KVCache |
large language model |
|
|
| 29 |
Hallucination is Inevitable for LLMs with the Open World Assumption |
重新审视大语言模型幻觉现象:开放世界假设下的必然产物 |
large language model |
|
|
| 30 |
ProxyAttn: Guided Sparse Attention via Representative Heads |
ProxyAttn:通过代表性注意力头引导的稀疏注意力机制,加速长文本处理。 |
large language model |
✅ |
|
| 31 |
Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection |
提出渐进式自反思(PSR)方法,提升大语言模型生成内容的安全性。 |
large language model |
|
|
| 32 |
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents |
MemGen:为自进化Agent构建生成式潜在记忆,提升推理能力 |
large language model |
|
|
| 33 |
Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset |
提出SOBACO:评估日语LLM社会偏见与文化常识的统一基准 |
large language model |
|
|
| 34 |
HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment |
提出HarmMetric Eval,用于全面评估LLM有害性评估指标与判别器的质量。 |
large language model |
✅ |
|
| 35 |
MAS$^2$: Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems |
提出MAS$^2$,一种自生成、自配置、自校正的多智能体系统,提升复杂任务性能。 |
large language model |
✅ |
|
| 36 |
Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models |
研究训练条件对语言模型参数化知识和上下文知识利用的影响 |
large language model |
|
|
| 37 |
Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents |
提出实例级上下文学习方法,提升LLM Agent在复杂任务中的表现 |
large language model |
|
|
| 38 |
SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents |
SimuHome:面向智能家居LLM代理的时间与环境感知基准测试 |
large language model |
|
|