cs.LG（2025-06-05）

📊 共 38 篇论文 | 🔗 8 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (20 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (14 🔗5) 支柱一：机器人控制 (Robot Control) (1 🔗1) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱五：交互与反应 (Interaction & Reaction) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

#	题目	一句话要点	标签	🔗
1	Tuning the Right Foundation Models is What you Need for Partial Label Learning	提出PartialCLIP以解决部分标签学习中的模型选择问题	foundation model	✅
2	PCDVQ: Enhancing Vector Quantization for Large Language Models via Polar Coordinate Decoupling	提出PCDVQ以解决大语言模型量化精度不足问题	large language model
3	LSM-2: Learning from Incomplete Wearable Sensor Data	提出LSM-2以解决可穿戴传感器数据不完整问题	foundation model multimodal
4	Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets	提出缩放法则以比较语言-视觉模型与数据集	foundation model	✅
5	Conformal Prediction Adaptive to Unknown Subpopulation Shifts	提出适应未知子群体转变的保形预测方法	large language model
6	Ravan: Multi-Head Low-Rank Adaptation for Federated Fine-Tuning	提出Ravan以解决联邦微调中的低秩适应问题	large language model
7	Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models	提出CPQ框架以解决生成模型的不确定性量化问题	large language model
8	Power Law Guided Dynamic Sifting for Efficient Attention	提出SiftAttention以解决GPU上大语言模型的内存带宽限制问题	large language model
9	Sample Complexity and Representation Ability of Test-time Scaling Paradigms	提出测试时缩放范式以提升大语言模型的样本效率	large language model
10	Transformers Meet In-Context Learning: A Universal Approximation Theory	提出通用逼近理论以解释变换器的上下文学习能力	large language model
11	Membership Inference Attacks on Sequence Models	提出基于序列模型的成员推断攻击以提高隐私审计效果	large language model
12	QiMeng: Fully Automated Hardware and Software Design for Processor Chip	提出QiMeng以实现处理器芯片的全自动硬件和软件设计	large language model
13	BacPrep: An Experimental Platform for Evaluating LLM-Based Bacalaureat Assessment	提出BacPrep平台以解决罗马尼亚高考备考反馈不足问题	large language model
14	FPTQuant: Function-Preserving Transforms for LLM Quantization	提出FPTQuant以解决大语言模型量化效率问题	large language model
15	Agentic AI for Intent-Based Industrial Automation	提出意图驱动的Agentic AI框架以简化工业自动化	large language model
16	Sparse Autoencoders, Again?	提出混合模型以解决稀疏自编码器的局限性	large language model
17	Enhancing Delta Compression in LLMs via SVD-based Quantization Error Minimization	提出DeltaMix框架以解决LLMs的量化误差问题	large language model
18	Towards Better Generalization via Distributional Input Projection Network	提出分布式输入投影网络以提升模型泛化能力	large language model
19	MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs	提出MobiEdit以解决移动设备上个性化LLM知识编辑问题	large language model
20	Clustering and Median Aggregation Improve Differentially Private Inference	通过聚类与中位数聚合提升差分隐私推断质量	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (14 篇)

#	题目	一句话要点	标签	🔗
21	Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic	提出DoSAC以解决强化学习中的隐性混淆问题	reinforcement learning policy learning SAC
22	Dissecting Long-Chain-of-Thought Reasoning Models: An Empirical Study	系统分析长链推理模型以提升推理能力与效率	reinforcement learning chain-of-thought	✅
23	Aligning Multimodal Representations through an Information Bottleneck	通过信息瓶颈原理提出新方法以解决多模态表示对齐问题	representation learning multimodal
24	TabFlex: Scaling Tabular Learning to Millions with Linear Attention	提出TabFlex以解决大规模表格学习效率问题	linear attention large language model
25	Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data	提出Agentomics-ML以解决生物数据自动化建模问题	predictive model large language model multimodal	✅
26	Mixture-of-Experts Meets In-Context Reinforcement Learning	提出T2MIR框架以解决ICRL中的多模态与任务异质性问题	reinforcement learning contrastive learning	✅
27	StatsMerging: Statistics-Guided Model Merging via Task-Specific Teacher Distillation	提出StatsMerging以解决模型合并中的标签依赖问题	distillation
28	Two-dimensional Taxonomy for N-ary Knowledge Representation Learning Methods	提出二维分类法以解决n元知识表示学习的复杂性问题	representation learning
29	Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay	提出难度针对的在线数据选择与回放重放以提高LLM强化微调的数据效率	reinforcement learning large language model	✅
30	Mitigating Degree Bias Adaptively with Hard-to-Learn Nodes in Graph Contrastive Learning	提出HAR损失以适应性缓解图对比学习中的度偏差问题	contrastive learning
31	TreeRPO: Tree Relative Policy Optimization	提出TreeRPO以优化推理过程中的奖励信号	reinforcement learning large language model	✅
32	UnHiPPO: Uncertainty-aware Initialization for State Space Models	提出UnHiPPO以解决状态空间模型中的噪声问题	state space model
33	When Maximum Entropy Misleads Policy Optimization	分析最大熵强化学习在控制任务中的误导性	reinforcement learning reward design
34	Learning long range dependencies through time reversal symmetry breaking	提出RHEL算法以解决长程依赖学习问题	SSM state space model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
35	A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search	提出SAILOR以解决行为克隆方法的局限性	manipulation imitation learning diffusion policy	✅

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
36	Multi-Point Proximity Encoding For Vector-Mode Geospatial Machine Learning	提出多点接近编码以解决向量模式地理空间机器学习问题	spatial relationship

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
37	Spectral Graph Neural Networks are Incomplete on Graphs with a Simple Spectrum	提出旋转等变神经网络以提升谱图神经网络的表达能力	OMOMO

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
38	FaCTR: Factorized Channel-Temporal Representation Transformers for Efficient Time Series Forecasting	提出FaCTR以解决时间序列预测中的过度参数化问题	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页

cs.LG（2025-06-05）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (20 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (14 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册