| 1 |
A Survey on Retrieval And Structuring Augmented Generation with Large Language Models |
综述检索与结构增强的大语言模型生成方法,解决幻觉、知识过时和领域受限问题。 |
large language model multimodal |
|
|
| 2 |
Readme_AI: Dynamic Context Construction for Large Language Models |
Readme_AI:提出动态上下文构建方法,提升大语言模型在特定查询下的准确性和可靠性。 |
large language model |
✅ |
|
| 3 |
Established Psychometric vs. Ecologically Valid Questionnaires: Rethinking Psychological Assessments in Large Language Models |
对比心理测量问卷与生态效度问卷,重新评估大语言模型中的心理评估方法 |
large language model |
|
|
| 4 |
Large Language Models Meet Legal Artificial Intelligence: A Survey |
综述:大型语言模型赋能法律人工智能 |
large language model |
✅ |
|
| 5 |
Arabic Large Language Models for Medical Text Generation |
提出并微调阿拉伯语大型语言模型,用于生成医疗文本,辅助医院管理系统。 |
large language model |
|
|
| 6 |
SearchInstruct: Enhancing Domain Adaptation via Retrieval-Based Instruction Dataset Creation |
SearchInstruct:通过检索增强的指令数据集创建提升领域自适应 |
large language model instruction following |
✅ |
|
| 7 |
Pluralistic Alignment for Healthcare: A Role-Driven Framework |
提出EthosAgents框架,增强医疗领域大语言模型对多元价值观的对齐。 |
large language model |
|
|
| 8 |
Is In-Context Learning Learning? |
研究表明上下文学习是一种有效的学习范式,但其泛化能力有限 |
chain-of-thought |
|
|
| 9 |
Population-Aligned Persona Generation for LLM-based Social Simulation |
提出人口对齐的Persona生成框架,提升LLM社会模拟的真实性和准确性 |
large language model |
|
|
| 10 |
No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes |
仅凭问题预测LLM答案准确性:线性探针揭示模型内部置信度 |
large language model |
|
|
| 11 |
Benchmark of stylistic variation in LLM-generated texts |
构建LLM文本风格基准:分析人类与AI生成文本的文体差异 |
large language model |
|
|
| 12 |
On LLM-Based Scientific Inductive Reasoning Beyond Equations |
提出SIRBench-V1基准,评估LLM在科学场景下超越方程的归纳推理能力 |
large language model |
|
|
| 13 |
Opening the Black Box: Interpretable LLMs via Semantic Resonance Architecture |
提出语义共振架构SRA,提升LLM可解释性与专家利用率。 |
large language model |
|
|
| 14 |
Unsupervised Hallucination Detection by Inspecting Reasoning Processes |
提出IRIS框架,通过检查LLM推理过程实现无监督幻觉检测 |
large language model |
|
|
| 15 |
Reasoning Under Uncertainty: Exploring Probabilistic Reasoning Capabilities of LLMs |
首个LLM概率推理能力综合研究:揭示优势与局限,探索未来改进方向 |
large language model |
|
|
| 16 |
Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts |
利用熵神经元抑制上下文复制,解决LLM参数知识与上下文冲突问题 |
large language model |
|
|
| 17 |
RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment |
提出RefactorCoderQA基准和云边协同架构,提升LLM在多领域代码问题解决能力 |
large language model |
|
|
| 18 |
Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs |
提出DERN:一种免训练的专家剪枝与神经元重组框架,提升稀疏MoE LLM性能。 |
large language model |
|
|
| 19 |
Beyond Token Limits: Assessing Language Model Performance on Long Text Classification |
评估语言模型在长文本分类任务上的性能,发现长文本专用模型并无明显优势。 |
large language model |
|
|
| 20 |
Incongruent Positivity: When Miscalibrated Positivity Undermines Online Supportive Conversations |
研究表明,LLM在情感支持对话中易产生不恰当的积极回应,并提出检测方法。 |
large language model |
|
|
| 21 |
HetaRAG: Hybrid Deep Retrieval-Augmented Generation across Heterogeneous Data Stores |
HetaRAG:跨异构数据存储的混合深度检索增强生成框架 |
large language model |
✅ |
|
| 22 |
Whisper Has an Internal Word Aligner |
Whisper内部蕴含词对齐能力,无需额外训练即可高精度提取词级时间戳。 |
TAMP |
|
|