cs.LG（2025-08-12）

📊 共 25 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (13 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (10) 支柱八：物理动画 (Physics-based Animation) (2)

🔬 支柱二：RL算法与架构 (RL & Architecture) (13 篇)

#	题目	一句话要点	标签	🔗	⭐
1	$\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models	提出M²LLM以解决分子属性预测的多视角问题	representation learning large language model
2	Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving	提出GF-Reasoner以解决几何问题求解中的推理不足	reinforcement learning chain-of-thought
3	Scaling Up Active Testing to Large Language Models	提出高效的主动测试方法以评估大型语言模型	predictive model large language model
4	Generative Modeling for Robust Deep Reinforcement Learning on the Traveling Salesman Problem	提出COGS以解决旅行商问题的分布鲁棒性挑战	reinforcement learning deep reinforcement learning
5	Distilling Reinforcement Learning into Single-Batch Datasets	提出强化学习蒸馏方法以生成单批次数据集	reinforcement learning distillation
6	Interpretable Reward Model via Sparse Autoencoder	提出稀疏自编码器增强的奖励模型以解决传统模型可解释性不足问题	reinforcement learning RLHF large language model	✅
7	Multi-level Collaborative Distillation Meets Global Workspace Model: A Unified Framework for OCIL	提出多层协作蒸馏以解决在线增量学习中的稳定性与适应性问题	distillation
8	A Personalized Exercise Assistant using Reinforcement Learning (PEARL): Results from a four-arm Randomized-controlled Trial	提出个性化运动助手PEARL以解决身体活动不足问题	reinforcement learning
9	Pattern-based Knowledge Component Extraction from Student Code Using Representation Learning	提出基于模式的知识组件提取框架以解决编程教育中的自动化问题	representation learning
10	Constrained Black-Box Attacks Against Multi-Agent Reinforcement Learning	提出约束黑箱攻击方法以解决多智能体强化学习的脆弱性问题	reinforcement learning
11	PersRM-R1: Enhance Personalized Reward Modeling with Reinforcement Learning	提出PersRM-R1以解决个性化奖励建模中的数据稀缺问题	reinforcement learning
12	GRAVITY: A Controversial Graph Representation Learning for Vertex Classification	提出GRAVITY以解决图节点分类中的动态聚合问题	representation learning
13	MCLPD:Multi-view Contrastive Learning for EEG-based PD Detection Across Datasets	提出MCLPD以解决跨数据集的帕金森病检测问题	contrastive learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
14	KnowDR-REC: A Benchmark for Referring Expression Comprehension with Real-World Knowledge	提出KnowDR-REC以解决多模态推理能力不足问题	large language model multimodal visual grounding
15	A Generative Imputation Method for Multimodal Alzheimer's Disease Diagnosis	提出生成填补方法以解决阿尔茨海默病多模态数据缺失问题	multimodal
16	Oblivionis: A Lightweight Learning and Unlearning Framework for Federated Large Language Models	提出Oblivionis框架以解决联邦大语言模型的遗忘问题	large language model
17	Resurrecting the Salmon: Rethinking Mechanistic Interpretability with Domain-Specific Sparse Autoencoders	提出领域特定稀疏自编码器以提升语言模型的可解释性	large language model foundation model
18	Teaching Code Refactoring Using LLMs	利用大型语言模型提升代码重构教学效果	large language model
19	xRFM: Accurate, scalable, and interpretable feature learning models for tabular data	提出xRFM以解决表格数据特征学习问题	foundation model
20	LLM Empowered Prototype Learning for Zero and Few-Shot Tasks on Tabular Data	提出基于LLM的原型学习框架以解决表格数据的零样本和少样本问题	large language model
21	Differentiated Information Mining: A Semi-supervised Learning Framework for GNNs	提出差异化因子一致性半监督框架以解决GNN伪标签偏差问题	multimodal
22	MiGrATe: Mixed-Policy GRPO for Adaptation at Test-Time	提出MiGrATe以解决黑箱优化任务中的适应性问题	large language model
23	Classifier Language Models: Unifying Sparse Finetuning and Adaptive Tokenization for Specialized Classification Tasks	提出稀疏微调与自适应标记化结合的方法以解决专业分类任务问题	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
24	GSMT: Graph Fusion and Spatiotemporal TaskCorrection for Multi-Bus Trajectory Prediction	提出GSMT以解决城市公交轨迹预测问题	spatiotemporal multimodal
25	UQGNN: Uncertainty Quantification of Graph Neural Networks for Multivariate Spatiotemporal Prediction	提出UQGNN以解决多变量时空预测中的不确定性量化问题	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页