cs.LG(2025-09-06)
📊 共 11 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (6)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Fisher Random Walk: Automatic Debiasing Contextual Preference Inference for Large Language Model Evaluation | 提出Fisher随机游走方法,用于大规模语言模型评估中的自动去偏上下文偏好推断。 | large language model | ||
| 2 | time2time: Causal Intervention in Hidden States to Simulate Rare Events in Time Series Foundation Models | 提出时间序列Transformer模型的因果干预方法,模拟罕见事件并进行压力测试。 | foundation model | ||
| 3 | Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction | 提出一种自适应路由框架,用于解决多模态多任务预测中的数据异构性问题。 | multimodal | ||
| 4 | GraMFedDHAR: Graph Based Multimodal Differentially Private Federated HAR | GraMFedDHAR:图神经网络与差分隐私联邦学习用于多模态人体活动识别 | multimodal | ||
| 5 | ProfilingAgent: Profiling-Guided Agentic Reasoning for Adaptive Model Optimization | ProfilingAgent:利用Profiling引导的Agentic推理实现自适应模型优化 | large language model foundation model | ||
| 6 | Finetuning LLMs for Human Behavior Prediction in Social Science Experiments | 通过微调LLM,提升社会科学实验中人类行为预测的准确性 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 7 | Causal Debiasing Medical Multimodal Representation Learning with Missing Modalities | 提出一种因果去偏的多模态医学表征学习方法,解决缺失模态带来的偏差问题。 | predictive model representation learning multimodal | ||
| 8 | Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies | 模型强化学习中离线与在线学习对比研究,揭示数据收集策略对性能的影响 | reinforcement learning world model model-based RL | ||
| 9 | Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks | 提出基于预期学习的强化学习框架,解决长时程任务中的层级策略学习问题 | reinforcement learning geometric consistency |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | A Physics-Informed Neural Networks-Based Model Predictive Control Framework for $SIR$ Epidemics | 提出基于物理信息神经网络的模型预测控制框架以解决SIR疫情建模问题 | MPC model predictive control | ||
| 11 | Simulation Priors for Data-Efficient Deep Learning | SimPEL:利用仿真先验提升深度学习在数据稀缺场景下的性能 | sim-to-real reinforcement learning |