cs.CL（2025-09-02）

📊 共 34 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (24 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (7 🔗1) 支柱七：动作重定向 (Motion Retargeting) (1 🔗1) 支柱四：生成式动作 (Generative Motion) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

#	题目	一句话要点	标签	🔗
1	VaccineRAG: Boosting Multimodal Large Language Models' Immunity to Harmful RAG Samples	提出VaccineRAG以解决RAG样本对LLMs的影响问题	large language model multimodal chain-of-thought
2	IDEAlign: Comparing Large Language Models to Human Experts in Open-ended Interpretive Annotations	IDEAlign：通过“奇数挑一”范式，评估LLM在开放式解释性标注任务中与人类专家的对齐程度	large language model
3	Clustering Discourses: Racial Biases in Short Stories about Women Generated by Large Language Models	揭示LLaMA 3.2-3B生成短篇小说中关于黑人和白人女性的种族偏见	large language model
4	Scaling behavior of large language models in emotional safety classification across sizes and tasks	研究LLM在情感安全分类中的规模效应，探索轻量级模型在心理健康领域的应用潜力	large language model
5	Comparative Study of Pre-Trained BERT and Large Language Models for Code-Mixed Named Entity Recognition	对比研究预训练BERT与大语言模型在Code-Mixed命名实体识别中的性能	large language model
6	An Ensemble Classification Approach in A Multi-Layered Large Language Model Framework for Disease Prediction	提出一种多层LLM框架下的集成方法，用于提升阿拉伯语社交媒体疾病预测精度。	large language model
7	E-THER: A Multimodal Dataset for Empathic AI -- Towards Emotional Mismatch Awareness	提出E-THER多模态数据集，用于提升AI在识别言语-视觉情感不一致方面的能力。	multimodal
8	DeepSeek performs better than other Large Language Models in Dental Cases	DeepSeek在大语言模型牙科病例分析中表现优于其他模型	large language model
9	Behavioral Fingerprinting of Large Language Models	提出大语言模型行为指纹框架，揭示模型对齐策略差异	large language model	✅
10	DRAssist: Dispute Resolution Assistance using Large Language Models	提出DRAssist以利用大型语言模型解决争议问题	large language model
11	Extracting OPQRST in Electronic Health Records using Large Language Models with Reasoning	利用大型语言模型与推理能力，从电子病历中提取OPQRST信息。	large language model
12	FActBench: A Benchmark for Fine-grained Automatic Evaluation of LLM-Generated Text in the Medical Domain	构建医学领域LLM生成文本自动评估基准FActBench，提升事实性评估准确度	large language model chain-of-thought
13	How Instruction-Tuning Imparts Length Control: A Cross-Lingual Mechanistic Analysis	研究指令调优如何赋予大语言模型长度控制能力：一种跨语言的机制分析	large language model foundation model
14	PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture	PalmX 2025：首个面向阿拉伯和伊斯兰文化的大语言模型评测共享任务	large language model
15	MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds	MoSEs：基于风格专家混合与条件阈值的不确定性感知AI生成文本检测	large language model	✅
16	SpecEval: Evaluating Model Adherence to Behavior Specifications	SpecEval：评估大模型行为规范一致性，发现高达20%的合规性差距。	foundation model
17	LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue	提出基于LLM的两步框架，评估团队对话中共享心智模型的偏差。	large language model
18	Towards Fundamental Language Models: Does Linguistic Competence Scale with Model Size?	提出基础语言模型范式，探索语言能力与模型规模的解耦策略	large language model
19	Avoidance Decoding for Diverse Multi-Branch Story Generation	提出Avoidance Decoding，解决LLM故事生成中多样性不足和重复性问题。	large language model
20	AMBEDKAR-A Multi-level Bias Elimination through a Decoding Approach with Knowledge Augmentation for Robust Constitutional Alignment of Language Models	提出AMBEDKAR框架，通过知识增强解码消除LLM中印度社会偏见，提升宪法一致性。	large language model
21	JudgeAgent: Knowledge-wise and Dynamic LLM Evaluation with Agent-as-Interviewer	提出JudgeAgent，利用Agent-as-Interviewer进行知识驱动的LLM动态评估	large language model	✅
22	Better by Comparison: Retrieval-Augmented Contrastive Reasoning for Automatic Prompt Optimization	提出对比推理提示优化（CRPO），通过检索增强对比学习提升LLM提示质量。	large language model
23	Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation	提出Genetic Prompt，利用LLM作为遗传算法模拟器，实现条件性合成数据生成。	large language model
24	Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts	提出RW-Steering，通过上下文工程提升LLM在混合和不当上下文中的可信度	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

#	题目	一句话要点	标签	🔗
25	Understanding Reinforcement Learning for Model Training, and future directions with GRAPE	深入剖析指令调优强化学习算法，并提出GRAPE新方向	reinforcement learning PPO DPO
26	Jointly Reinforcing Diversity and Quality in Language Model Generations	提出DARLING框架，联合强化语言模型生成的多样性和质量，提升创造性任务表现。	reinforcement learning large language model instruction following
27	Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR	提出PACS框架，通过监督学习隐式耦合Actor-Critic，提升RLVR中LLM的推理能力。	reinforcement learning PPO large language model	✅
28	ProST: Progressive Sub-task Training for Pareto-Optimal Multi-agent Systems Using Small Language Models	提出ProST渐进式子任务训练方法，提升小型语言模型多智能体系统在复杂任务中的效率和效果。	curriculum learning large language model
29	GRAM-R$^2$: Self-Training Generative Foundation Reward Models for Reward Reasoning	提出GRAM-R$^2$，通过自训练生成式奖励模型实现奖励推理，提升任务泛化性。	reinforcement learning foundation model
30	DCPO: Dynamic Clipping Policy Optimization	DCPO：动态裁剪策略优化，提升LLM在可验证奖励下的推理能力	reinforcement learning large language model
31	StructCoh: Structured Contrastive Learning for Context-Aware Text Semantic Matching	提出StructCoh以解决文本语义匹配中的结构性与语义细微差异问题	contrastive learning

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
32	Implicit Reasoning in Large Language Models: A Comprehensive Survey	综述：大型语言模型中的隐式推理研究进展与执行范式分析	latent optimization large language model chain-of-thought	✅

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
33	Spectrogram Patch Codec: A 2D Block-Quantized VQ-VAE and HiFi-GAN for Neural Speech Coding	提出基于2D块量化VQ-VAE和HiFi-GAN的语音编码方法，简化神经语音编解码器设计。	VQ-VAE

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
34	SSVD: Structured SVD for Parameter-Efficient Fine-Tuning and Benchmarking under Domain Shift in ASR	提出结构化SVD引导的微调方法SSVD，提升ASR领域迁移性能。	semantic mapping semantic map foundation model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2025-09-02）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (24 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (7 篇)

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册