cs.AI（2026-04-09）

📊 共 48 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (30 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (15 🔗3) 支柱六：视频提取与匹配 (Video Extraction) (1) 支柱一：机器人控制 (Robot Control) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (30 篇)

#	题目	一句话要点	标签	🔗
1	How Far Are Large Multimodal Models from Human-Level Spatial Action? A Benchmark for Goal-Oriented Embodied Navigation in Urban Airspace	构建城市空域导航基准，评估大型多模态模型在具身空间行为中的能力	vision-language-action multimodal	✅
2	MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems	MONETA：利用地理信息和多智能体系统进行多模态行业分类	large language model multimodal
3	CIAO - Code In Architecture Out - Automated Software Architecture Documentation with Large Language Models	CIAO：利用大语言模型自动生成软件架构文档，提升系统可理解性。	large language model
4	ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models	提出ImplicitMemBench，用于评估大语言模型中无意识行为适应能力的基准测试。	large language model
5	MIMIC-Py: An Extensible Tool for Personality-Driven Automated Game Testing with Large Language Models	MIMIC-Py：基于LLM的性格驱动型自动化游戏测试可扩展工具	large language model	✅
6	Emotion Concepts and their Function in a Large Language Model	发现大语言模型中功能性情绪：情绪概念影响模型行为与对齐	large language model
7	Learning Who Disagrees: Demographic Importance Weighting for Modeling Annotator Distributions with DiADEM	提出DiADEM模型，通过人口统计学重要性加权建模标注者分布，提升主观内容理解。	large language model chain-of-thought
8	Wiring the 'Why': A Unified Taxonomy and Survey of Abductive Reasoning in LLMs	构建演绎推理统一分类法，并对LLM中的溯因推理进行全面调研。	large language model
9	Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction	提出RAG-XAI框架，用于检测机器驱动的同行评审，保障科研创造力。	large language model
10	Visual Perceptual to Conceptual First-Order Rule Learning Networks	提出γILP框架，解决从图像数据中学习一阶规则并自动生成谓词的难题。	large language model
11	From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis	针对多Agent LLM系统中同伴保护现象，提出基于身份匿名化的民主讨论分析架构设计	large language model
12	Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing	提出SAVeR框架，通过自审计保证LLM Agent推理过程的忠实性。	large language model
13	SkillClaw: Let Skills Evolve Collectively with Agentic Evolver	SkillClaw：通过Agentic Evolver实现技能的集体进化，提升多用户Agent生态系统性能	large language model
14	Don't Overthink It: Inter-Rollout Action Agreement as a Free Adaptive-Compute Signal for LLM Agents	TrACE：基于行动一致性的LLM Agent自适应计算控制器	large language model
15	Neural-Symbolic Knowledge Tracing: Injecting Educational Knowledge into Deep Learning for Responsible Learner Modelling	提出Responsible-DKT，融合教育知识的神经符号知识追踪方法，提升学习者建模的责任性。	large language model
16	IoT-Brain: Grounding LLMs for Semantic-Spatial Sensor Scheduling	IoT-Brain：通过空间轨迹图STG连接LLM与物理世界，实现语义空间传感器调度	large language model
17	DialBGM: A Benchmark for Background Music Recommendation from Everyday Multi-Turn Dialogues	DialBGM：提出一个日常多轮对话背景音乐推荐的基准数据集。	multimodal
18	An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks	提出Agentic评估架构，用于检测教育教科书中存在的历史偏见。	multimodal
19	PyVRP$^+$: LLM-Driven Metacognitive Heuristic Evolution for Hybrid Genetic Search in Vehicle Routing Problems	PyVRP$^+$：基于LLM驱动的元认知启发式进化，用于车辆路径问题中的混合遗传搜索	large language model
20	SPARD: Self-Paced Curriculum for RL Alignment via Integrating Reward Dynamics and Data Utility	SPARD：通过整合奖励动态和数据效用，实现强化学习对齐的自步课程学习	large language model
21	Filling the Gaps: Selective Knowledge Augmentation for LLM Recommenders	提出KnowSA_CKP，通过选择性知识增强提升LLM推荐器的性能和效率	large language model
22	More Capable, Less Cooperative? When LLMs Fail At Zero-Cost Collaboration	揭示LLM在零成本协作中失效的原因，强调智能扩展并非解决多智能体协作的唯一途径	large language model
23	Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution	Squeeze Evolve：用于无验证器进化的统一多模型编排框架	multimodal
24	Towards Knowledgeable Deep Research: Framework and Benchmark	提出混合知识分析框架HKA，解决深度研究中结构化与非结构化知识融合问题。	multimodal
25	Multi-Agent Orchestration for High-Throughput Materials Screening on a Leadership-Class System	提出基于多Agent协同的高通量材料筛选框架，提升HPC系统利用率。	large language model
26	MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems	MONETA：利用地理信息和多智能体系统进行多模态行业分类	large language model multimodal
27	Demystifying the Silence of Correctness Bugs in PyTorch Compiler	针对PyTorch编译器正确性Bug，提出基于LLM变异的检测方法AlignGuard	large language model
28	Model Space Reasoning as Search in Feedback Space for Planning Domain Generation	提出基于反馈空间搜索的模型空间推理方法，用于规划领域自动生成。	large language model
29	Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution	Squeeze Evolve：用于无验证器进化的统一多模型编排框架	multimodal
30	Towards Knowledgeable Deep Research: Framework and Benchmark	提出混合知识分析框架HKA，解决深度研究中结构化与非结构化知识融合问题。	multimodal

🔬 支柱二：RL算法与架构 (RL & Architecture) (15 篇)

#	题目	一句话要点	标签	🔗
31	ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer	ASPECT：通过语言条件迁移实现模拟语义策略执行	reinforcement learning large language model language conditioned
32	WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models	WorldMAP：利用生成式世界模型引导视觉-语言导航轨迹预测	world model world models egocentric
33	Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest	分析大型语言模型在利益冲突下的行为，揭示广告植入对用户福利的影响	reinforcement learning large language model
34	Multimodal Reasoning with LLM for Encrypted Traffic Interpretation: A Benchmark	提出BGTD基准和mmTraffic框架，用于加密流量的可解释多模态推理。	Mamba multimodal	✅
35	Investigation of Automated Design of Quantum Circuits for Imaginary Time Evolution Methods Using Deep Reinforcement Learning	利用深度强化学习自动设计量子电路，加速虚时演化算法	reinforcement learning deep reinforcement learning
36	Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework	提出临床世界模型和技能组合框架，弥合临床AI能力与人类认知之间的差距	world model world models
37	SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions	SUPERNOVA：利用自然指令上的强化学习提升LLM的通用推理能力	reinforcement learning large language model	✅
38	Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling	提出Plan-RewardBench，用于评估工具集成环境中轨迹级奖励模型的对齐能力。	reinforcement learning RLHF large language model
39	HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation	提出HiRO-Nav以解决长时间导航任务中的推理效率问题	reinforcement learning multimodal
40	Beyond Stochastic Exploration: What Makes Training Data Valuable for Agentic Search	提出HiExp框架，提升Agentic Search中LLM推理效率与训练稳定性	reinforcement learning large language model
41	ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning	提出ReRec，通过强化微调增强LLM在推荐任务中的推理能力	reward shaping instruction following	✅
42	Mitigating Distribution Sharpening in Math RLVR via Distribution-Aligned Hint Synthesis and Backward Hint Annealing	提出DAHS和BHA，缓解数学RLVR中分布锐化问题，提升解题覆盖率。	reinforcement learning teacher-student
43	ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer	ASPECT：通过语言条件迁移实现模拟语义策略执行	reinforcement learning large language model language conditioned
44	RAMP: Hybrid DRL for Online Learning of Numeric Action Models	提出RAMP混合DRL算法，用于在线学习数值动作模型。	reinforcement learning deep reinforcement learning DRL
45	eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming	提出eBandit以解决自适应视频流中的网络监测不足问题	reinforcement learning

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
46	From Gaze to Guidance: Interpreting and Adapting to Users' Cognitive Needs with Multimodal Gaze-Aware AI Assistants	提出基于眼动追踪的多模态AI助手，提升用户认知能力。	egocentric multimodal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
47	Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions	提出RAG安全威胁分类体系，分析攻击、防御与未来方向	manipulation large language model

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
48	PSI: Shared State as the Missing Layer for Coherent AI-Generated Instruments in Personal AI Agents	PSI：共享状态作为个人AI Agent中连贯AI生成工具的关键层	affordance

⬅️ 返回 cs.AI 首页 · 🏠 返回主页

cs.AI（2026-04-09）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (30 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (15 篇)

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理