cs.LG(2025-05-31)

📊 共 32 篇论文 | 🔗 6 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (18 🔗3) 支柱九:具身大模型 (Embodied Foundation Models) (14 🔗3)

🔬 支柱二:RL算法与架构 (RL & Architecture) (18 篇)

#题目一句话要点标签🔗
1 QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training 提出QoQ-Med以解决多模态临床决策中的数据不平衡问题 reinforcement learning foundation model multimodal
2 A Brain Graph Foundation Model: Pre-Training and Prompt-Tuning for Any Atlas and Disorder 提出脑图基础模型以解决神经科学领域的多样性问题 masked autoencoder contrastive learning large language model
3 MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning 提出MMedAgent-RL以解决多模态医疗推理中的协作问题 reinforcement learning curriculum learning multimodal
4 From Rules to Rewards: Reinforcement Learning for Interest Rate Adjustment in DeFi Lending 应用离线强化学习优化DeFi借贷中的利率调整 reinforcement learning TD3 offline reinforcement learning
5 Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning 提出AdaPR以解决4D流MRI重建中的适应性问题 reinforcement learning deep reinforcement learning DRL
6 Prompt-Tuned LLM-Augmented DRL for Dynamic O-RAN Network Slicing 提出基于提示调优的LLM增强DRL方法以解决动态O-RAN网络切片问题 reinforcement learning deep reinforcement learning DRL
7 A New Spatiotemporal Correlation Anomaly Detection Method that Integrates Contrastive Learning and Few-Shot Learning in Wireless Sensor Networks 提出MTAD-RD以解决无线传感器网络异常检测中的样本不足问题 contrastive learning spatiotemporal
8 Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning 提出非线性注意机制以加速强化学习收敛 reinforcement learning linear attention
9 RLAE: Reinforcement Learning-Assisted Ensemble for LLMs 提出RLAE以解决LLM集成动态权重调整问题 reinforcement learning PPO large language model
10 Dynamic Domain Adaptation-Driven Physics-Informed Graph Representation Learning for AC-OPF 提出DDA-PIGCN以解决AC-OPF约束建模问题 representation learning MAE spatiotemporal
11 Optimized Local Updates in Federated Learning via Reinforcement Learning 通过强化学习优化联邦学习中的本地更新 reinforcement learning deep reinforcement learning DRL
12 ORAN-GUIDE: RAG-Driven Prompt Learning for LLM-Augmented Reinforcement Learning in O-RAN Network Slicing 提出ORAN-GUIDE以解决O-RAN网络切片中的动态资源分配问题 reinforcement learning deep reinforcement learning DRL
13 Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments 提出一种新的度量学习方法以应对强化学习中的干扰问题 reinforcement learning deep reinforcement learning
14 Reinforcement Learning for Hanabi 探索强化学习在Hanabi游戏中的应用与表现 reinforcement learning deep reinforcement learning
15 CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries 提出CLARIFY以解决模糊查询的偏好强化学习问题 reinforcement learning contrastive learning
16 Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn 通过减少波动性提出C-CHAIN以缓解持续强化学习中的可塑性损失 reinforcement learning
17 AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs 提出AutoMixAlign以解决多任务偏好优化问题 DPO large language model
18 Comparing Traditional and Reinforcement-Learning Methods for Energy Storage Control 比较传统与强化学习方法在能源存储控制中的应用 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (14 篇)

#题目一句话要点标签🔗
19 A Foundation Model for Non-Destructive Defect Identification from Vibrational Spectra 提出DefectNet以解决非破坏性缺陷识别问题 foundation model
20 Existing Large Language Model Unlearning Evaluations Are Inconclusive 提出新评估原则以解决大语言模型去学习评估不确定性问题 large language model
21 Probabilistic Forecasting for Building Energy Systems using Time-Series Foundation Models 提出时间序列基础模型以提升建筑能源系统预测精度 foundation model
22 M2WLLM: Multi-Modal Multi-Task Ultra-Short-term Wind Power Prediction Algorithm Based on Large Language Model 提出M2WLLM以解决超短期风电预测精度不足问题 large language model
23 Spectral Insights into Data-Oblivious Critical Layers in Large Language Models 提出数据无关方法识别大型语言模型中的关键层 large language model
24 Power-of-Two (PoT) Weights in Large Language Models (LLMs) 提出PoT权重以降低大语言模型的复杂性 large language model
25 Pitfalls in Evaluating Language Model Forecasters 提出评估语言模型预测能力的新方法以解决评估挑战 large language model
26 Linear Representation Transferability Hypothesis: Leveraging Small Models to Steer Large Models 提出线性表示可转移假设以引导大模型行为 large language model
27 FLoE: Fisher-Based Layer Selection for Efficient Sparse Adaptation of Low-Rank Experts 提出FLoE以解决低秩专家适应中的层选择问题 large language model
28 It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs 提出基于广义高斯先验的优化框架以提升大语言模型训练效率 large language model
29 BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation 提出BenchHub以解决LLM评估基准分散问题 large language model
30 Revisiting LLMs as Zero-Shot Time-Series Forecasters: Small Noise Can Break Large Models 评估LLMs在零-shot时间序列预测中的有效性及其噪声敏感性问题 large language model
31 Channel Normalization for Time Series Channel Identification 提出通道归一化以解决时间序列通道可识别性问题 foundation model
32 BatteryBERT for Realistic Battery Fault Detection Using Point-Masked Signal Modeling 提出BatteryBERT以解决电池故障检测中的时序数据问题 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页