| 1 |
English Pronunciation Evaluation without Complex Joint Training: LoRA Fine-tuned Speech Multimodal LLM |
利用LoRA微调多模态LLM实现高效英语发音评估与诊断 |
large language model multimodal |
|
|
| 2 |
ProMQA-Assembly: Multimodal Procedural QA Dataset on Assembly |
提出ProMQA-Assembly多模态程序化问答数据集,用于评估装配任务助手。 |
multimodal |
|
|
| 3 |
Structure-Learnable Adapter Fine-Tuning for Parameter-Efficient Large Language Models |
提出结构可学习的Adapter微调方法,提升大语言模型参数效率和任务适应性 |
large language model |
|
|
| 4 |
Beyond ROUGE: N-Gram Subspace Features for LLM Hallucination Detection |
提出基于N-Gram子空间特征的LLM幻觉检测方法,显著提升检测性能。 |
large language model |
|
|
| 5 |
NoteBar: An AI-Assisted Note-Taking System for Personal Knowledge Management |
NoteBar:一种AI辅助的笔记系统,用于个人知识管理 |
large language model |
|
|
| 6 |
Curse of Knowledge: When Complex Evaluation Context Benefits yet Biases LLM Judges |
构建ComplexEval基准,揭示并量化LLM评判在复杂评估中存在的辅助信息诱导偏差问题。 |
large language model |
|
|
| 7 |
Using LLMs to create analytical datasets: A case study of reconstructing the historical memory of Colombia |
利用大型语言模型重建哥伦比亚历史记忆,创建分析数据集。 |
large language model |
|
|
| 8 |
SESGO: Spanish Evaluation of Stereotypical Generative Outputs |
SESGO:提出西班牙语刻板印象生成输出评估框架,填补多语言LLM偏见评估的空白。 |
large language model |
|
|
| 9 |
Domain Adaptation of LLMs for Process Data |
提出基于LLM领域自适应的过程数据预测方法,提升预测过程监控性能 |
large language model |
|
|
| 10 |
Measuring Scalar Constructs in Social Science with LLMs |
利用LLM测量社会科学中的标量结构,提出token概率加权评分方法并验证其有效性。 |
large language model |
|
|
| 11 |
DiaCBT: A Long-Periodic Dialogue Corpus Guided by Cognitive Conceptualization Diagram for CBT-based Psychological Counseling |
DiaCBT:构建认知概念化图引导的CBT心理咨询长周期对话语料库 |
large language model |
|
|
| 12 |
Artificially Fluent: Swahili AI Performance Benchmarks Between English-Trained and Natively-Trained Datasets |
对比英语训练与斯瓦希里语原生训练,揭示LLM跨语言性能差异 |
large language model |
|
|
| 13 |
Mitigation of Gender and Ethnicity Bias in AI-Generated Stories through Model Explanations |
提出BAME方法,利用模型解释缓解AI生成故事中的性别和种族偏见。 |
large language model |
|
|