cs.LG(2025-08-26)

📊 共 25 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (12 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (10) 支柱八:物理动画 (Physics-based Animation) (3)

🔬 支柱二:RL算法与架构 (RL & Architecture) (12 篇)

#题目一句话要点标签🔗
1 (DEMO) Deep Reinforcement Learning Based Resource Allocation in Distributed IoT Systems 提出基于深度强化学习的资源分配框架以解决分布式物联网系统问题 reinforcement learning deep reinforcement learning DRL
2 DRMD: Deep Reinforcement Learning for Malware Detection under Concept Drift 提出DRMD以解决恶意软件检测中的概念漂移问题 reinforcement learning deep reinforcement learning DRL
3 HAEPO: History-Aggregated Exploratory Policy Optimization 提出HAEPO以解决长时间任务探索不足的问题 reinforcement learning PPO DPO
4 History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL 提出RhymeRL以解决大语言模型强化学习中的GPU利用率低下问题 reinforcement learning large language model
5 Re:Frame -- Retrieving Experience From Associative Memory 提出Re:Frame以解决离线强化学习中的专家数据稀缺问题 reinforcement learning offline RL offline reinforcement learning
6 Beyond Tokens: Enhancing RTL Quality Estimation via Structural Graph Learning 提出StructRTL框架以提升RTL设计质量估计 representation learning distillation large language model
7 Latent Variable Modeling in Multi-Agent Reinforcement Learning via Expectation-Maximization for UAV-Based Wildlife Protection 提出基于期望最大化的潜变量建模以解决无人机野生动物保护问题 reinforcement learning PPO
8 Stability and Generalization for Bellman Residuals 提出Bellman残差最小化以解决离线强化学习中的一致性问题 reinforcement learning offline reinforcement learning inverse reinforcement learning
9 Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks 提出混合专家模型的最优稀疏性以提升推理任务性能 reinforcement learning large language model
10 Atrial Fibrillation Prediction Using a Lightweight Temporal Convolutional and Selective State Space Architecture 提出轻量级深度学习模型以实现心房颤动的早期预测 Mamba state space model
11 Revisiting associative recall in modern recurrent models 探讨现代递归模型中的联想回忆问题及其优化策略 Mamba SSM
12 Dual-Distilled Heterogeneous Federated Learning with Adaptive Margins for Trainable Global Prototypes 提出双蒸馏异构联邦学习以解决原型边界收缩问题 contrastive learning distillation

🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)

#题目一句话要点标签🔗
13 Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in Multimodal LLMs 提出谱图框架量化多模态LLM中的幻觉问题 multimodal
14 FFT-MoE: Efficient Federated Fine-Tuning for Foundation Models via Large-scale Sparse MoE under Heterogeneous Edge 提出FFT-MoE以解决异构边缘环境下的联邦微调问题 foundation model
15 The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability 提出多模态物理信息声学模型以增强市场波动预测能力 multimodal
16 Fine-Tuning Vision-Language Models for Neutrino Event Analysis in High-Energy Physics Experiments 提出基于视觉-语言模型的中微子事件分类方法 large language model multimodal
17 Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding 提出LRTab以提升大型语言模型在表格理解中的推理能力 large language model chain-of-thought
18 Understanding Tool-Integrated Reasoning 提出工具集成推理以提升大语言模型能力 large language model
19 APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration 提出APT-LLM以解决大语言模型加速问题 large language model
20 PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations 提出PAX-TS以解决时间序列预测模型的可解释性问题 large language model
21 Enhancing Model Privacy in Federated Learning with Random Masking and Quantization 提出FedQSN以解决联邦学习中的模型隐私保护问题 large language model
22 Rethinking Caching for LLM Serving Systems: Beyond Traditional Heuristics 提出SISO以优化大语言模型服务系统中的缓存策略 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (3 篇)

#题目一句话要点标签🔗
23 GENIE-ASI: Generative Instruction and Executable Code for Analog Subcircuit Identification 提出GENIE-ASI以解决模拟电路子电路识别问题 AMP large language model foundation model
24 Data-Augmented Few-Shot Neural Emulator for Computer-Model System Identification 提出数据增强的少样本神经仿真器以解决计算模型系统识别问题 spatiotemporal
25 Universal Dynamics with Globally Controlled Analog Quantum Simulators 提出全球控制的模拟器以实现普适量子动力学 PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页