cs.AI(2025-05-30)

📊 共 39 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (27 🔗4) 支柱二:RL算法与架构 (RL & Architecture) (8) 支柱一:机器人控制 (Robot Control) (3 🔗1) 支柱三:空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (27 篇)

#题目一句话要点标签🔗
1 Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs 提出隐性推理方法以解决多模态大语言模型的不足问题 large language model multimodal
2 MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge 提出MELT以解决情感数据标注的自动化问题 large language model multimodal
3 Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents 提出Open CaptchaWorld以解决多模态LLM代理在CAPTCHA挑战中的不足 multimodal
4 Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models 提出AdaCVD以解决心血管疾病风险预测中的数据异质性问题 large language model
5 The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features 评估大型语言模型在地理特征表示中的可靠性 large language model
6 Gated Multimodal Graph Learning for Personalized Recommendation 提出RLMultimodalRec以解决多模态推荐中的融合挑战 multimodal
7 Towards Scalable Schema Mapping using Large Language Models 提出基于大语言模型的可扩展模式映射方法以解决数据集成挑战 large language model
8 Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models 提出分步生成框架以提升城市设计中的人机协作 multimodal
9 FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation 提出FABLE基准以评估大语言模型的数据流推理能力 large language model
10 Evaluation of LLMs for mathematical problem solving 评估大型语言模型在数学问题求解中的表现 large language model chain-of-thought
11 Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success 提出随机规则森林以解决创业成功预测问题 large language model
12 Chances and Challenges of the Model Context Protocol in Digital Forensics and Incident Response 提出模型上下文协议以解决数字取证中的透明性和可解释性问题 large language model
13 MIR: Methodology Inspiration Retrieval for Scientific Research Problems 提出方法论灵感检索以解决科学研究问题 large language model
14 Whispers of Many Shores: Cultural Alignment through Collaborative Cultural Expertise 提出软提示微调框架以解决文化对齐问题 large language model
15 Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings 提出DEEVO框架以优化大语言模型的提示工程 large language model
16 A survey of using EHR as real-world evidence for discovering and validating new drug indications 综述电子健康记录在新药适应症发现中的应用与挑战 large language model
17 Memory OS of AI Agent 提出MemoryOS以解决大语言模型的长期记忆管理问题 large language model
18 Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction 提出NextLocMoE以解决个性化和语义感知的下一个位置预测问题 large language model
19 Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation 提出基于知识图谱和大型语言模型的结构化虚假信息生成方法 large language model
20 Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning 优化知识图谱与大语言模型接口以提升复杂推理能力 large language model
21 LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs 提出LPASS以提高压缩LLM在漏洞检测中的效率 large language model
22 RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation 提出RMoA以优化多智能体系统的效率与可靠性 large language model
23 GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments 提出GridRoute基准以提升LLM在网格环境中的路径规划能力 large language model
24 TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents 提出TRAPDOC以解决用户对大型语言模型的过度依赖问题 large language model
25 Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules 提出QuAda以解决大语言模型中的引用意识对话问题 large language model
26 E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness 提出E^2GraphRAG以解决图基RAG效率低下问题 large language model
27 Learning API Functionality from In-Context Demonstrations for Tool-based Agents 提出从上下文示例中学习API功能以解决文档缺失问题 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)

#题目一句话要点标签🔗
28 MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning 提出MiCRo框架以解决个性化偏好学习中的多样性问题 reinforcement learning preference learning RLHF
29 SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought 提出SCOUT框架以提升语言模型推理能力 distillation large language model chain-of-thought
30 How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning 探讨SFT与RL的协同作用以提升LLM推理能力 reinforcement learning large language model chain-of-thought
31 AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models 提出AXIOM以解决深度强化学习的数据效率问题 reinforcement learning deep reinforcement learning DRL
32 A Reward-driven Automated Webshell Malicious-code Generator for Red-teaming 提出RAWG以解决WebShell恶意代码生成多样性不足问题 reinforcement learning PPO large language model
33 Control-R: Towards controllable test-time scaling 提出Reasoning Control Fields以解决长链推理中的控制问题 distillation chain-of-thought
34 ProofNet++: A Neuro-Symbolic System for Formal Proof Verification with Self-Correction 提出ProofNet++以解决自动定理证明中的逻辑推理问题 reinforcement learning large language model
35 Intrinsic Goals for Autonomous Agents: Model-Based Exploration in Virtual Zebrafish Predicts Ethological Behavior and Whole-Brain Dynamics 提出3M-Progress以解决自主智能体探索不足问题 reinforcement learning world model

🔬 支柱一:机器人控制 (Robot Control) (3 篇)

#题目一句话要点标签🔗
36 SEAR: A Multimodal Dataset for Analyzing AR-LLM-Driven Social Engineering Behaviors 提出SEAR数据集以分析增强现实驱动的社会工程攻击行为 manipulation large language model multimodal
37 Adversarial Threat Vectors and Risk Mitigation for Retrieval-Augmented Generation Systems 提出风险控制措施以应对检索增强生成系统的对抗威胁 manipulation large language model
38 SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems 提出SentinelAgent以解决多智能体系统中的异常检测问题 manipulation large language model

🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)

#题目一句话要点标签🔗
39 A Red Teaming Roadmap Towards System-Level Safety 提出系统级安全红队策略以应对LLM的安全挑战 affordance large language model

⬅️ 返回 cs.AI 首页 · 🏠 返回主页