cs.LG(2025-05-06)

📊 共 25 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (11 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (11 🔗4) 支柱一:机器人控制 (Robot Control) (2) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (11 篇)

#题目一句话要点标签🔗
1 Policy-labeled Preference Learning: Is Preference Enough for RLHF? 提出政策标签偏好学习以解决RLHF中的偏好不足问题 reinforcement learning offline RL preference learning
2 DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning 提出DYSTIL以解决强化学习中的策略生成问题 reinforcement learning large language model
3 Sustainable Smart Farm Networks: Enhancing Resilience and Efficiency with Decision Theory-Guided Deep Reinforcement Learning 提出基于决策理论的深度强化学习以提升智能农场网络的韧性与效率 reinforcement learning deep reinforcement learning DRL
4 Joint Resource Management for Energy-efficient UAV-assisted SWIPT-MEC: A Deep Reinforcement Learning Approach 提出无人机辅助SWIPT-MEC系统以解决能源效率与计算资源分配问题 reinforcement learning deep reinforcement learning SAC
5 Interpretable Learning Dynamics in Unsupervised Reinforcement Learning 提出可解释性框架以理解无监督强化学习中的内在动机 reinforcement learning PPO representation learning
6 Ergodic Generative Flows 提出厄尔戈迪克生成流以解决生成流网络训练挑战 reinforcement learning imitation learning flow matching
7 A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM) 提出LBRM算法以解决生成模型中的记忆化问题 predictive model
8 Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance 提出基于余弦距离的知识蒸馏方法以改善语音去噪性能 distillation
9 Absolute Zero: Reinforced Self-play Reasoning with Zero Data 提出Absolute Zero以解决无数据强化学习中的推理问题 reinforcement learning large language model
10 Importance Analysis for Dynamic Control of Balancing Parameter in a Simple Knowledge Distillation Setting 提出动态调整平衡参数以优化知识蒸馏效果 distillation
11 Unraveling the Rainbow: can value-based methods schedule? 提出基于价值的方法以解决作业调度问题 reinforcement learning deep reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)

#题目一句话要点标签🔗
12 Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey 调查大型语言模型在复杂问题解决中的知识增强应用 large language model chain-of-thought
13 Geospatial Mechanistic Interpretability of Large Language Models 提出地理空间机制可解释性框架以解析大型语言模型的地理信息处理 large language model foundation model
14 Task-Oriented Multimodal Token Transmission in Resource-Constrained Multiuser Networks 提出任务导向的多模态令牌传输方案以解决资源受限网络中的效率问题 multimodal
15 Automatic Calibration for Membership Inference Attack on Large Language Models 提出自动校准会员推断攻击以解决大语言模型的隐私问题 large language model
16 Adversarial Attacks in Multimodal Systems: A Practitioner's Survey 调查多模态系统中的对抗攻击以填补实践者视角的空白 multimodal
17 Revisiting Model Inversion Evaluation: From Misleading Standards to Reliable Privacy Assessment 提出新评估框架以解决模型反演攻击的评估问题 large language model multimodal
18 MARCO: Multi-Agent Code Optimization with Real-Time Knowledge Integration for High-Performance Computing 提出MARCO框架以解决高性能计算中的代码优化问题 large language model
19 Mitigating mode collapse in normalizing flows by annealing with an adaptive schedule: Application to parameter estimation 通过自适应调度退火缓解归一化流中的模式崩溃问题 multimodal
20 PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model 提出PARM以解决多目标测试时对齐问题 large language model
21 SPAP: Structured Pruning via Alternating Optimization and Penalty Methods 提出SPAP以解决大语言模型结构化剪枝问题 large language model
22 Plug-and-Play AMC: Context Is King in Training-Free, Open-Set Modulation with LLMs 提出基于LLM的自动调制分类框架以应对信号干扰问题 foundation model

🔬 支柱一:机器人控制 (Robot Control) (2 篇)

#题目一句话要点标签🔗
23 Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning 提出HInt以解决目标导向强化学习中的稀疏奖励问题 locomotion reinforcement learning
24 Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability 提出因果干预框架以提升变分自编码器的机制可解释性 manipulation

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
25 Quantum Feature Space of a Qubit Coupled to an Arbitrary Bath 提出量子特征空间以高效分类量子比特噪声 PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页