cs.LG（2025-05-20）

📊 共 60 篇论文 | 🔗 14 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (30 🔗10) 支柱二：RL算法与架构 (RL & Architecture) (23 🔗2) 支柱一：机器人控制 (Robot Control) (3 🔗2) 支柱八：物理动画 (Physics-based Animation) (2) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱五：交互与反应 (Interaction & Reaction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (30 篇)

#	题目	一句话要点	标签	🔗
1	Output Scaling: YingLong-Delayed Chain of Thought in a Large Pretrained Time Series Forecasting Model	提出YingLong框架以提升时间序列预测精度	foundation model chain-of-thought	✅
2	Towards Non-Euclidean Foundation Models: Advancing AI Beyond Euclidean Frameworks	提出非欧几里得基础模型以解决现有几何框架的局限性	large language model foundation model	✅
3	KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models	提出KERL系统以解决个性化食谱推荐问题	large language model	✅
4	Foundations of Unknown-aware Machine Learning	提出未知感知学习框架以解决机器学习模型的可靠性问题	large language model foundation model multimodal
5	Quartet: Native FP4 Training Can Be Optimal for Large Language Models	提出Quartet以优化大型语言模型的FP4训练	large language model	✅
6	This Time is Different: An Observability Perspective on Time Series Foundation Models	提出Toto模型以解决多变量可观测时间序列预测问题	foundation model	✅
7	LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language Models	提出LeanCode以解决大规模语言模型代码简化问题	large language model
8	Table Foundation Models: on knowledge pre-training for tabular learning	提出TARTE模型以解决表格学习中的知识预训练问题	foundation model
9	LLMSynthor: Macro-Aligned Micro-Records Synthesis with Large Language Models	提出LLMSynthor以解决宏观数据与微观记录不一致问题	large language model
10	MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow	提出MAS-KCL以优化知识组件图结构学习	large language model
11	Fusing Cross-Domain Knowledge from Multimodal Data to Solve Problems in the Physical World	提出跨域多模态数据融合框架以解决现实问题	multimodal
12	Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners	提出对抗预训练变换器以解决轻量级鲁棒性问题	foundation model	✅
13	Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity	提出Polar Sparsity以解决大规模LLM推理效率问题	large language model	✅
14	The Role of Visualization in LLM-Assisted Knowledge Graph Systems: Effects on User Trust, Exploration, and Workflows	提出LinkQ以解决知识图谱探索中的用户信任问题	large language model
15	FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain	提出FisherSFT以提高语言模型的监督微调效率	large language model
16	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	提出对比LoRA解码以提升大语言模型的任务性能	large language model
17	Spiking Neural Networks with Temporal Attention-Guided Adaptive Fusion for imbalanced Multi-modal Learning	提出时序注意力引导的自适应融合以解决多模态学习不平衡问题	multimodal
18	LLINBO: Trustworthy LLM-in-the-Loop Bayesian Optimization	提出LLINBO以解决LLM在贝叶斯优化中的不确定性问题	large language model	✅
19	ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs	提出ServerlessLoRA以解决LoRA LLM推理中的延迟与成本问题	large language model
20	Interpretable Neural System Dynamics: Combining Deep Learning with System Dynamics Modeling to Support Critical Applications	提出可解释的神经系统动力学框架以解决深度学习与系统动力学的结合问题	multimodal
21	Byte Pair Encoding for Efficient Time Series Forecasting	提出基于模式的时间序列编码方法以提高预测效率	foundation model
22	Low-Cost FlashAttention with Fused Exponential and Multiplication Hardware Operators	提出融合指数与乘法运算的硬件操作以优化FlashAttention	large language model
23	Scaling Law for Quantization-Aware Training	提出统一缩放法则以优化量化感知训练	large language model
24	Safety Subspaces are Not Linearly Distinct: A Fine-Tuning Case Study	研究安全子空间与线性独立性，揭示模型安全性挑战	large language model	✅
25	Acoustic and Machine Learning Methods for Speech-Based Suicide Risk Assessment: A Systematic Review	利用声学与机器学习方法评估自杀风险	multimodal
26	Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis	提出Quaff以解决资源受限设备上LLM微调效率问题	large language model	✅
27	When LLMs meet open-world graph learning: a new perspective for unlabeled data uncertainty	提出开放世界图助手以解决未标记数据的不确定性问题	large language model
28	Causes and Consequences of Representational Similarity in Machine Learning Models	探讨数据集重叠与任务重叠对模型表示相似性的影响	large language model
29	The Energy Cost of Reasoning: Analyzing Energy Usage in LLMs with Test-time Compute	提出测试时间计算以提高大语言模型的能效与准确性	large language model
30	FlowBERT: Prompt-tuned BERT for variable flow field prediction	提出FlowBERT以解决传统CFD方法计算成本高的问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (23 篇)

#	题目	一句话要点	标签	🔗
31	Structured Agent Distillation for Large Language Model	提出结构化代理蒸馏以解决大语言模型压缩问题	imitation learning distillation large language model
32	Modality-Balancing Preference Optimization of Large Multimodal Models by Adversarial Negative Mining	提出多模态平衡偏好优化方法以解决模态失衡问题	preference learning large language model multimodal
33	InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models	提出InfiFPO以解决大语言模型融合中的偏好对齐问题	DPO direct preference optimization large language model
34	FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning	提出FlowQ以解决离线强化学习中的指导问题	reinforcement learning offline reinforcement learning flow matching
35	Time to Embed: Unlocking Foundation Models for Time Series with Channel Descriptions	提出CHARM以解决时间序列建模的局限性问题	representation learning foundation model
36	Energy-Efficient Deep Reinforcement Learning with Spiking Transformers	提出Spike-Transformer强化学习算法以解决能耗问题	reinforcement learning deep reinforcement learning
37	AAPO: Enhancing the Reasoning Capabilities of LLMs with Advantage Momentum	提出AAPO以解决现有RL方法在推理能力提升中的低效问题	reinforcement learning PPO large language model
38	Imitation Learning via Focused Satisficing	提出聚焦满意度的模仿学习方法以提升行为接受度	reinforcement learning deep reinforcement learning imitation learning
39	The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents	提出五阶段分类法以推动金融领域的智能投资系统发展	representation learning large language model multimodal
40	Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks	提出Kolmogorov-Arnold网络以解决负载均衡的可解释强化学习问题	reinforcement learning PPO
41	Preference Learning with Lie Detectors can Induce Honesty or Evasion	通过谎言探测器的偏好学习提升AI系统的诚实性	preference learning DPO
42	Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation	提出一种高效的连续时间强化学习算法以解决样本和计算效率问题	reinforcement learning
43	Text embedding models can be great data engineers	提出ADEPT以自动化数据工程管道问题	predictive model TAMP
44	TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning	提出TinyV以解决验证器假阴性问题	reinforcement learning large language model	✅
45	Performance Optimization of Energy-Harvesting Underlay Cognitive Radio Networks Using Reinforcement Learning	提出强化学习优化能量采集下的认知无线电网络性能	reinforcement learning
46	KIPPO: Koopman-Inspired Proximal Policy Optimization	提出KIPPO以解决复杂动态环境中的策略优化问题	reinforcement learning policy learning PPO
47	Bellman operator convergence enhancements in reinforcement learning algorithms	提出贝尔曼算子改进以提升强化学习算法收敛性	reinforcement learning
48	Personalised Insulin Adjustment with Reinforcement Learning: An In-Silico Validation for People with Diabetes on Intensive Insulin Treatment	提出自适应基础-波动剂量建议系统以优化糖尿病患者胰岛素调整	reinforcement learning
49	FlowTSE: Target Speaker Extraction with Flow Matching	提出FlowTSE以解决目标说话人提取问题	flow matching
50	Self Distillation via Iterative Constructive Perturbations	提出循环优化框架以提升深度学习模型的泛化能力	distillation
51	From Reasoning to Code: GRPO Optimization for Underrepresented Languages	提出GRPO优化方法以解决小众编程语言代码生成问题	reinforcement learning large language model
52	Riemannian Flow Matching for Brain Connectivity Matrices via Pullback Geometry	提出DiffeoCFM以解决脑连接矩阵生成问题	flow matching	✅
53	When to retrain a machine learning model	提出基于不确定性的模型重训练方法以应对数据演变问题	reinforcement learning offline reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗
54	RLVR-World: Training World Models with Reinforcement Learning	提出RLVR-World以优化世界模型的任务特定目标	manipulation reinforcement learning world model	✅
55	Flattening Hierarchies with Policy Bootstrapping	提出一种新算法以解决长时间目标条件强化学习中的层次性问题	locomotion manipulation reinforcement learning	✅
56	Lessons from Defending Gemini Against Indirect Prompt Injections	提出对抗性评估框架以增强Gemini模型的鲁棒性	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
57	Physics-Guided Learning of Meteorological Dynamics for Weather Downscaling and Forecasting	提出PhyDL-NWP以解决传统天气预报的计算与物理不足问题	spatiotemporal
58	A PID-Controlled Tensor Wheel Decomposition Model for Dynamic Link Prediction	提出PID控制的张量轮分解模型以解决动态链接预测问题	spatiotemporal

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
59	Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models	提出文本引导向量以提升多模态大语言模型的视觉理解能力	spatial relationship large language model multimodal

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
60	Securing Transfer-Learned Networks with Reverse Homomorphic Encryption	提出一种新型同态加密方法以保护转移学习网络的训练数据	OMOMO

⬅️ 返回 cs.LG 首页 · 🏠 返回主页

cs.LG（2025-05-20）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (30 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (23 篇)

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册