cs.LG(2025-05-28)
📊 共 9 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (5 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | SlimLLM: Accurate Structured Pruning for Large Language Models | 提出SlimLLM以解决大语言模型的结构化剪枝问题 | large language model | ||
| 2 | Revisiting Bayesian Model Averaging in the Era of Foundation Models | 提出基于贝叶斯模型平均的线性分类器以提升分类性能 | foundation model | ||
| 3 | Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges | 提出多模态数据融合方法以提升SARS-COV-2病例激增预测能力 | multimodal | ||
| 4 | SimuGen: Multi-modal Agentic Framework for Constructing Block Diagram-Based Simulation Models | 提出SimuGen以解决Simulink模型生成问题 | large language model multimodal | ✅ | |
| 5 | FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design | 提出FALCON框架以实现全自动化的模拟电路设计 | foundation model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Reinforcement Learning for Out-of-Distribution Reasoning in LLMs: An Empirical Study on Diagnosis-Related Group Coding | 提出DRG-Sapphire以解决临床笔记中的DRG编码问题 | reinforcement learning large language model | ||
| 7 | SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training | 提出SDPO以解决扩散模型训练中的偏差和不稳定问题 | preference learning DPO direct preference optimization | ||
| 8 | A Provable Approach for End-to-End Safe Reinforcement Learning | 提出可证明的终身安全强化学习方法以解决安全性问题 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection | 提出假数据注入模型以解决随机带宽的对抗攻击问题 | manipulation |