cs.LG（2023-12-28）

📊 共 18 篇论文

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (11) 支柱九：具身大模型 (Embodied Foundation Models) (3) 支柱一：机器人控制 (Robot Control) (2) 支柱四：生成式动作 (Generative Motion) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?	主动采样减少离线强化学习中的因果混淆	reinforcement learning offline reinforcement learning
2	Generalizable Visual Reinforcement Learning with Segment Anything Model	提出SAM-G框架，利用SAM提升视觉强化学习在未知环境中的泛化能力	reinforcement learning foundation model
3	Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e	提出基于神经PID策略的PPO算法，用于Mu2e实验中的质子束强度控制	reinforcement learning PPO
4	Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity	提出表示复杂性层次以重构强化学习范式	reinforcement learning model-based RL
5	RLPlanner: Reinforcement Learning based Floorplanning for Chiplets with Fast Thermal Analysis	RLPlanner：基于强化学习的Chiplet Floorplanning，加速热分析	reinforcement learning MAE
6	Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources	提出一种基于EXP3的对抗算法以解决资源约束下的偏好学习问题	preference learning
7	Resilient Constrained Reinforcement Learning	提出弹性约束强化学习，解决约束条件未知下的强化学习问题	reinforcement learning
8	Improving Intrusion Detection with Domain-Invariant Representation Learning in Latent Space	提出基于领域不变表征学习的入侵检测方法，提升零日攻击检测能力	representation learning
9	FedSDD: Scalable and Diversity-enhanced Distillation for Model Aggregation in Federated Learning	FedSDD：面向联邦学习的可扩展、多样性增强的蒸馏模型聚合方法	distillation
10	Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms	提出Agnostic交互式模仿学习算法MFTPL-P与Bootstrap-Dagger，解决专家策略非策略类问题。	imitation learning
11	Layer Attack Unlearning: Fast and Accurate Machine Unlearning via Layer Level Attack and Knowledge Distillation	提出层攻击卸载学习，通过层级攻击和知识蒸馏实现快速精确的机器卸载学习。	distillation

🔬 支柱九：具身大模型 (Embodied Foundation Models) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
12	Non-Vacuous Generalization Bounds for Large Language Models	为大型语言模型提供非平凡泛化界限，揭示其泛化能力	large language model
13	The LLM Surgeon	提出基于Kronecker因子曲率逼近的大语言模型剪枝方法，实现高效压缩。	large language model
14	Fast Inference of Mixture-of-Experts Language Models with Offloading	提出一种加速MoE语言模型推理的Offloading策略，可在消费级硬件上运行Mixtral-8x7B。	large language model

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Gradient-based Planning with World Models	提出基于梯度的世界模型规划方法，提升在复杂环境中的控制性能。	MPC model predictive control world model
16	The Duck's Brain: Training and Inference of Neural Networks in Modern Database Engines	在数据库引擎中训练和推理神经网络：关系代数与SQL实现	manipulation

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
17	Fast gradient-free activation maximization for neurons in spiking neural networks	提出基于张量分解的无梯度激活最大化方法，用于分析脉冲神经网络神经元选择性。	VQ-VAE

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
18	An Adaptive Framework of Geographical Group-Specific Network on O2O Recommendation	提出GeoGrouse框架，解决O2O推荐中地理区域用户偏好建模的个性化问题	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页