cs.LG（2026-04-08）

📊 共 26 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (12 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (10) 支柱一：机器人控制 (Robot Control) (3) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	OmniTabBench: Mapping the Empirical Frontiers of GBDTs, Neural Networks, and Foundation Models for Tabular Data at Scale	OmniTabBench：大规模表格数据上GBDT、神经网络和基础模型的经验前沿探索	large language model foundation model
2	Frailty Estimation in Elderly Oncology Patients Using Multimodal Wearable Data and Multi-Instance Learning	提出基于多模态可穿戴数据和多示例学习的老年肿瘤患者虚弱程度评估框架	multimodal
3	STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training	提出STQuant框架，通过时空自适应优化器量化降低大模型训练内存占用。	multimodal
4	Geometric Properties of the Voronoi Tessellation in Latent Semantic Manifolds of Large Language Models	研究大型语言模型潜在语义空间中的Voronoi tessellation几何特性，提出margin refinement procedures优化模型。	large language model
5	Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach	提出双层异构联邦学习方法，用于训练时间序列基础模型，提升异构环境下的泛化能力。	foundation model
6	On the Price of Privacy for Language Identification and Generation	研究差分隐私对语言识别与生成任务的影响，量化隐私保护的代价。	large language model
7	Beyond the Mean: Modelling Annotation Distributions in Continuous Affect Prediction	提出基于Beta分布的连续情感预测模型，建模标注分布以提升性能。	multimodal
8	Selective Neuron Amplification for Training-Free Task Enhancement	提出选择性神经元放大（SNA）方法，无需训练即可提升大语言模型在特定任务上的表现。	large language model
9	MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale	提出MoE路由测试平台，用于小规模研究专家特化和路由行为	large language model
10	ConceptTracer: Interactive Analysis of Concept Saliency and Selectivity in Neural Representations	ConceptTracer：交互式分析神经表征中概念显著性和选择性的工具	foundation model	✅
11	MoBiE: Efficient Inference of Mixture of Binary Experts under Post-Training Quantization	提出MoBiE以解决MoE模型量化效率问题	large language model	✅
12	ExplainFuzz: Explainable and Constraint-Conditioned Test Generation with Probabilistic Circuits	ExplainFuzz：利用概率电路实现可解释和约束条件的测试用例生成	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Equivariant Multi-agent Reinforcement Learning for Multimodal Vehicle-to-Infrastructure Systems	提出一种基于等变多智能体强化学习的V2I系统资源优化方法。	reinforcement learning multimodal
14	Epistemic Robust Offline Reinforcement Learning	提出基于不确定性集合的离线强化学习框架，提升策略鲁棒性和泛化性	reinforcement learning SAC offline RL
15	Smart Commander: A Hierarchical Reinforcement Learning Framework for Fleet-Level PHM Decision Optimization	提出Smart Commander以优化军用航空舰队的PHM决策	reinforcement learning deep reinforcement learning DRL
16	Multi-Turn Reasoning LLMs for Task Offloading in Mobile Edge Computing	提出COMLLM框架，利用多轮推理LLM解决移动边缘计算中的任务卸载问题	reinforcement learning deep reinforcement learning DRL
17	The Theorems of Dr. David Blackwell and Their Contributions to Artificial Intelligence	总结David Blackwell理论对AI的贡献，涵盖信息压缩、序贯决策和信息比较。	reinforcement learning RLHF large language model
18	Predictive Representations for Skill Transfer in Reinforcement Learning	提出基于结果预测状态表示的技能迁移强化学习方法	reinforcement learning
19	Extraction of linearized models from pre-trained networks via knowledge distillation	提出基于知识蒸馏的线性化模型提取框架，提升线性模型分类精度。	distillation
20	TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning	TwinLoop：面向在线多智能体强化学习的仿真环数字孪生，提升适应效率	reinforcement learning
21	Android Coach: Improve Online Agentic Training Efficiency with Single State Multiple Actions	Android Coach：通过单状态多动作提升在线Agent训练效率	reinforcement learning PPO
22	A First Guess is Rarely the Final Answer: Learning to Search in the Travelling Salesperson Problem	NICO-TSP：学习TSP问题的搜索策略，提升求解效率与泛化性	reinforcement learning imitation learning

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
23	Graph Neural ODE Digital Twins for Control-Oriented Reactor Thermal-Hydraulic Forecasting Under Partial Observability	提出GNN-ODE数字孪生模型，用于部分可观测下反应堆热工水力预测与控制	sim-to-real latent dynamics MAE
24	The Rhetoric of Machine Learning	揭示机器学习的修辞本质：从客观建模到说服艺术	manipulation world model world models
25	DDP-SA: Scalable Privacy-Preserving Federated Learning via Distributed Differential Privacy and Secure Aggregation	提出DDP-SA框架，通过分布式差分隐私和安全聚合实现可扩展的隐私保护联邦学习。	MPC

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
26	ELC: Evidential Lifelong Classifier for Uncertainty Aware Radar Pulse Classification	提出ELC：一种基于证据理论的终身分类器，用于不确定性感知的雷达脉冲分类	PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页