cs.AI(2025-08-31)
📊 共 23 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (16 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (16 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Self-Exploring Language Models for Explainable Link Forecasting on Temporal Graphs via Reinforcement Learning | 提出ReaL-TG框架以实现可解释的时间图链接预测 | reinforcement learning large language model | ||
| 18 | Clone What You Can't Steal: Black-Box LLM Replication via Logit Leakage and Distillation | 提出黑箱LLM复制方法以应对API安全漏洞 | distillation large language model | ||
| 19 | Adaptive Vehicle Speed Classification via BMCNN with Reinforcement Learning-Enhanced Acoustic Processing | 提出基于BMCNN和强化学习的自适应车辆速度分类方法以应对交通拥堵问题 | reinforcement learning PPO TD3 | ||
| 20 | CoreThink: A Symbolic Reasoning Layer to reason over Long Horizon Tasks with LLMs | 提出CoreThink以解决长时间任务推理问题 | reinforcement learning instruction following | ||
| 21 | TinyMusician: On-Device Music Generation with Knowledge Distillation and Mixed Precision Quantization | 提出TinyMusician以解决边缘设备音乐生成问题 | distillation | ||
| 22 | It's-A-Me, Quantum Mario: Scalable Quantum Reinforcement Learning with Multi-Chip Ensembles | 提出多芯片集成框架以解决量子强化学习的可扩展性问题 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 23 | Integrated Simulation Framework for Adversarial Attacks on Autonomous Vehicles | 提出集成仿真框架以应对自动驾驶车辆的对抗攻击问题 | manipulation |