cs.LG(2025-06-20)

📊 共 18 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (8) 支柱九:具身大模型 (Embodied Foundation Models) (8) 支柱七:动作重定向 (Motion Retargeting) (1 🔗1) 支柱四:生成式动作 (Generative Motion) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)

#题目一句话要点标签🔗
1 A Survey of State Representation Learning for Deep Reinforcement Learning 综述状态表示学习以提升深度强化学习的效率 reinforcement learning deep reinforcement learning representation learning
2 Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning 提出静态网络稀疏性以提升深度强化学习的扩展潜力 reinforcement learning deep reinforcement learning DRL
3 Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity 提出Sparse-Reg以解决离线强化学习中的小样本过拟合问题 reinforcement learning offline RL offline reinforcement learning
4 TransDreamerV3: Implanting Transformer In DreamerV3 提出TransDreamerV3以提升复杂环境中的决策能力 reinforcement learning world model dreamer
5 No Free Lunch: Rethinking Internal Feedback for LLM Reasoning 提出内部反馈强化学习以提升大语言模型推理能力 reinforcement learning RLHF large language model
6 Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling? 探讨视觉语言模型在推理时间扩展中的自验证能力 reinforcement learning large language model
7 Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment 提出MARL-OD-DA以解决大规模交通分配问题 reinforcement learning
8 Metapath-based Hyperbolic Contrastive Learning for Heterogeneous Graph Embedding 提出基于元路径的双曲对比学习以解决异构图嵌入问题 contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)

#题目一句话要点标签🔗
9 IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification 提出IsoNet以解决多模态手势分类中的信息融合问题 multimodal
10 Universal Music Representations? Evaluating Foundation Models on World Music Corpora 评估基础模型在世界音乐语料库上的普适性 foundation model
11 Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs 利用大语言模型和概念图预测材料科学的新研究方向 large language model
12 SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification 提出SAFEx以解决MoE架构LLMs的安全对齐问题 large language model
13 Latent Concept Disentanglement in Transformer-based Language Models 提出潜在概念解耦方法以增强变换器语言模型的推理能力 large language model
14 TabArena: A Living Benchmark for Machine Learning on Tabular Data 提出TabArena以解决静态基准测试问题 foundation model
15 Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps 提出SeLoRA以解决LoRA参数冗余问题 foundation model
16 A Minimalist Optimizer Design for LLM Pretraining 提出SCALE优化器以提高LLM预训练效率 large language model

🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)

#题目一句话要点标签🔗
17 Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models 提出RATTPO以解决文本到图像生成中的提示优化问题 spatial relationship large language model

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
18 SIDE: Semantic ID Embedding for effective learning from sequences 提出SIDE方法以解决序列推荐系统中的嵌入规模问题 VQ-VAE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页