cs.LG（2025-08-27）

📊 共 29 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (15 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (11 🔗1) 支柱一：机器人控制 (Robot Control) (2) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (15 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Pre-trained knowledge elevates large language models beyond traditional chemical reaction optimizers	利用预训练知识提升大语言模型在化学反应优化中的表现	large language model
2	ECG-Soup: Harnessing Multi-Layer Synergy for ECG Foundation Models	提出ECG-Soup以提升心电图基础模型的性能	foundation model
3	Cross-Platform E-Commerce Product Categorization and Recategorization: A Multimodal Hierarchical Classification Approach	提出多模态层次分类框架以解决电商产品分类问题	multimodal
4	FinCast: A Foundation Model for Financial Time-Series Forecasting	提出FinCast以解决金融时间序列预测中的复杂性问题	foundation model
5	A Systematic Review on the Generative AI Applications in Human Medical Genomics	系统评估生成性AI在医学基因组学中的应用	large language model multimodal
6	SCAR: A Characterization Scheme for Multi-Modal Dataset	提出SCAR方案以表征多模态数据集特性	foundation model multimodal	✅
7	Robustness is Important: Limitations of LLMs for Data Fitting	揭示LLMs在数据拟合中的脆弱性及其局限性	large language model foundation model
8	The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network	提出LLM-RAN操作员以解决未来6G无线网络管理复杂性问题	large language model
9	LLM-QUBO: An End-to-End Framework for Automated QUBO Transformation from Natural Language Problem Descriptions	提出LLM-QUBO框架以自动化QUBO转换解决优化问题	large language model
10	Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence	提出Symphony以解决集中式多代理系统的局限性	large language model
11	Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation	提出线性时间示例选择算法以优化上下文学习	chain-of-thought
12	CrystalICL: Enabling In-Context Learning for Crystal Generation	提出CrystalICL以解决晶体生成中的少样本学习问题	large language model
13	Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs	提出生成自我精炼方法以提升大语言模型的推理能力	large language model
14	Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era	提出生成模型以解决数据稀缺和隐私问题	large language model	✅
15	MobText-SISA: Efficient Machine Unlearning for Mobility Logs with Spatio-Temporal and Natural-Language Data	提出MobText-SISA以解决移动日志中的机器遗忘问题	multimodal

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
16	Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning	提出反事实奖励模型以缓解多模态强化学习中的偏见问题	reinforcement learning RLHF representation learning
17	Data-Efficient Symbolic Regression via Foundation Model Distillation	提出EQUATE框架以解决小数据集下的符号回归问题	distillation foundation model
18	Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning	提出自适应缩放策略约束以解决离线强化学习中的超参数调优问题	reinforcement learning offline RL offline reinforcement learning	✅
19	Dynamics-Aligned Latent Imagination in Contextual World Models for Zero-Shot Generalization	提出DALI以解决零-shot泛化中的环境适应问题	reinforcement learning world model dreamer
20	Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning	提出RLTR框架以解决LLM代理规划能力不足问题	reinforcement learning large language model
21	Learning Game-Playing Agents with Generative Code Optimization	提出生成代码优化方法以学习游戏智能体	reinforcement learning deep reinforcement learning large language model
22	The Role of Teacher Calibration in Knowledge Distillation	提出教师模型校准方法以提升知识蒸馏效果	distillation
23	Reinforcement Learning for Search Tree Size Minimization in Constraint Programming: New Results on Scheduling Benchmarks	基于强化学习的约束编程搜索树大小最小化方法	reinforcement learning
24	Interestingness First Classifiers	提出EUREKA框架以构建有趣的分类器	Eureka large language model
25	MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation	提出MicroLad以解决3D微观结构重建问题	distillation
26	PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense	提出PoolFlip以解决网络防御中的决策自动化问题	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
27	Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey	综述多智能体强化学习在智能交通系统中的应用与挑战	sim-to-real reinforcement learning
28	Pruning Strategies for Backdoor Defense in LLMs	提出注意力头剪枝策略以防御大语言模型中的后门攻击	manipulation reinforcement learning

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Experimental End-to-End Optimization of Directly Modulated Laser-based IM/DD Transmission	基于数据驱动模型优化直接调制激光的传输性能	PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页