cs.AI（2025-05-30）

📊 共 39 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (27 🔗4) 支柱二：RL算法与架构 (RL & Architecture) (8) 支柱一：机器人控制 (Robot Control) (3 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

#	题目	一句话要点	标签	🔗
1	Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs	提出隐性推理方法以解决多模态大语言模型的不足问题	large language model multimodal
2	MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge	提出MELT以解决情感数据标注的自动化问题	large language model multimodal
3	Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents	提出Open CaptchaWorld以解决多模态LLM代理在CAPTCHA挑战中的不足	multimodal
4	Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models	提出AdaCVD以解决心血管疾病风险预测中的数据异质性问题	large language model
5	The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features	评估大型语言模型在地理特征表示中的可靠性	large language model
6	Gated Multimodal Graph Learning for Personalized Recommendation	提出RLMultimodalRec以解决多模态推荐中的融合挑战	multimodal
7	Towards Scalable Schema Mapping using Large Language Models	提出基于大语言模型的可扩展模式映射方法以解决数据集成挑战	large language model
8	Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models	提出分步生成框架以提升城市设计中的人机协作	multimodal
9	FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation	提出FABLE基准以评估大语言模型的数据流推理能力	large language model
10	Evaluation of LLMs for mathematical problem solving	评估大型语言模型在数学问题求解中的表现	large language model chain-of-thought
11	Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success	提出随机规则森林以解决创业成功预测问题	large language model
12	Chances and Challenges of the Model Context Protocol in Digital Forensics and Incident Response	提出模型上下文协议以解决数字取证中的透明性和可解释性问题	large language model
13	MIR: Methodology Inspiration Retrieval for Scientific Research Problems	提出方法论灵感检索以解决科学研究问题	large language model
14	Whispers of Many Shores: Cultural Alignment through Collaborative Cultural Expertise	提出软提示微调框架以解决文化对齐问题	large language model
15	Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings	提出DEEVO框架以优化大语言模型的提示工程	large language model
16	A survey of using EHR as real-world evidence for discovering and validating new drug indications	综述电子健康记录在新药适应症发现中的应用与挑战	large language model
17	Memory OS of AI Agent	提出MemoryOS以解决大语言模型的长期记忆管理问题	large language model	✅
18	Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction	提出NextLocMoE以解决个性化和语义感知的下一个位置预测问题	large language model
19	Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation	提出基于知识图谱和大型语言模型的结构化虚假信息生成方法	large language model
20	Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning	优化知识图谱与大语言模型接口以提升复杂推理能力	large language model
21	LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs	提出LPASS以提高压缩LLM在漏洞检测中的效率	large language model
22	RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation	提出RMoA以优化多智能体系统的效率与可靠性	large language model	✅
23	GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments	提出GridRoute基准以提升LLM在网格环境中的路径规划能力	large language model	✅
24	TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents	提出TRAPDOC以解决用户对大型语言模型的过度依赖问题	large language model	✅
25	Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules	提出QuAda以解决大语言模型中的引用意识对话问题	large language model
26	E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness	提出E^2GraphRAG以解决图基RAG效率低下问题	large language model
27	Learning API Functionality from In-Context Demonstrations for Tool-based Agents	提出从上下文示例中学习API功能以解决文档缺失问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签
28	MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning	提出MiCRo框架以解决个性化偏好学习中的多样性问题	reinforcement learning preference learning RLHF
29	SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought	提出SCOUT框架以提升语言模型推理能力	distillation large language model chain-of-thought
30	How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning	探讨SFT与RL的协同作用以提升LLM推理能力	reinforcement learning large language model chain-of-thought
31	AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models	提出AXIOM以解决深度强化学习的数据效率问题	reinforcement learning deep reinforcement learning DRL
32	A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming	提出RAWG以解决WebShell恶意代码生成多样性不足问题	reinforcement learning PPO large language model
33	Control-R: Towards controllable test-time scaling	提出Reasoning Control Fields以解决长链推理中的控制问题	distillation chain-of-thought
34	ProofNet++: A Neuro-Symbolic System for Formal Proof Verification with Self-Correction	提出ProofNet++以解决自动定理证明中的逻辑推理问题	reinforcement learning large language model
35	Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics	提出3M-Progress以解决自主智能体探索不足问题	reinforcement learning world model

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗
36	SEAR: A Multimodal Dataset for Analyzing AR-LLM-Driven Social Engineering Behaviors	提出SEAR数据集以分析增强现实驱动的社会工程攻击行为	manipulation large language model multimodal	✅
37	Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems	提出风险控制措施以应对检索增强生成系统的对抗威胁	manipulation large language model
38	SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems	提出SentinelAgent以解决多智能体系统中的异常检测问题	manipulation large language model

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
39	A Red Teaming Roadmap Towards System-Level Safety	提出系统级安全红队策略以应对LLM的安全挑战	affordance large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2025-05-30）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (27 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册