cs.LG(2025-05-11)
📊 共 8 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models | 提出知识蒸馏方法以提升沃尔玛电商搜索相关性 | distillation large language model | ||
| 2 | Multi-Objective-Guided Discrete Flow Matching for Controllable Biological Sequence Design | 提出多目标引导离散流匹配以解决可控生物序列设计问题 | flow matching | ||
| 3 | Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures | 提出层次语言模型的扩展理论以比较卷积与变换器架构 | representation learning | ||
| 4 | Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control | 提出基于强化学习的HVAC控制框架以应对城市气候建模挑战 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance | 提出GuidedQuant以解决大语言模型量化中的特征重要性问题 | large language model | ✅ | |
| 6 | Turning LLM Activations Quantization-Friendly | 提出量化友好的激活方法以降低LLM服务成本 | large language model | ||
| 7 | MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning | 提出MMiC框架以解决多模态联邦学习中的模态不完整问题 | multimodal | ✅ | |
| 8 | Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety | 提出Self-Inf-N以识别良性样本中的异常点,提升LLM安全性 | large language model | ✅ |