cs.LG(2025-06-11)

📊 共 13 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (6) 支柱二:RL算法与架构 (RL & Architecture) (5 🔗1) 支柱一:机器人控制 (Robot Control) (1) 支柱四:生成式动作 (Generative Motion) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (6 篇)

#题目一句话要点标签🔗
1 Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models 提出Athena-PRM以高效解决多模态推理中的奖励模型问题 multimodal
2 FedVLMBench: Benchmarking Federated Fine-Tuning of Vision-Language Models 提出FedVLMBench以解决联邦学习下视觉-语言模型微调评估问题 foundation model multimodal
3 AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent 提出AWP方法以解决大语言模型的压缩问题 large language model
4 Prompt Variability Effects On LLM Code Generation 提出合成评估管道以量化LLM代码生成的提示变异性影响 large language model
5 Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling 提出口头拒绝采样以解决LLM采样偏差问题 large language model
6 Learning Obfuscations Of LLM Embedding Sequences: Stained Glass Transform 提出Stained Glass Transform以解决LLM嵌入序列隐私问题 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
7 Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs 提出Omni-DPO以解决动态偏好学习中的数据利用问题 reinforcement learning preference learning RLHF
8 Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design 提出基于偏好的高效强化学习方法以解决查询选择问题 reinforcement learning
9 Probabilistic Variational Contrastive Learning 提出变分对比学习以解决不确定性量化问题 contrastive learning
10 Attention on flow control: transformer-based reinforcement learning for lift regulation in highly disturbed flows 提出基于变压器的强化学习以解决强干扰流中的升力调节问题 reinforcement learning
11 Canonical Latent Representations in Conditional Diffusion Models 提出CLAReps以解决条件扩散模型中的特征混淆问题 representation learning distillation

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
12 Provable Sim-to-Real Transfer via Offline Domain Randomization 提出离线领域随机化以解决仿真到现实转移问题 sim-to-real domain randomization reinforcement learning

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
13 AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year Scale 提出AtmosMJ以解决长时间天气预测的稳定性问题 physically plausible

⬅️ 返回 cs.LG 首页 · 🏠 返回主页