cs.LG(2026-01-06)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | Adversarial Contrastive Learning for LLM Quantization Attacks | 提出对抗对比学习ACL,提升LLM量化攻击的成功率 | contrastive learning large language model | ||
| 11 | Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression | 提出稀疏知识蒸馏框架以优化模型压缩与温度缩放问题 | distillation | ||
| 12 | Decentralized Autoregressive Generation | 提出去中心化自回归生成方法,解决多模态语言模型训练中的专家协作问题。 | flow matching multimodal | ||
| 13 | Causal Manifold Fairness: Enforcing Geometric Invariance in Representation Learning | 提出因果流形公平性(CMF),通过几何不变性实现表征学习中的公平性。 | representation learning | ||
| 14 | In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior | SPICE:通过上下文和价值先验的贝叶斯融合实现上下文强化学习 | reinforcement learning | ||
| 15 | Stratified Hazard Sampling: Minimal-Variance Event Scheduling for CTMC/DTMC Discrete Diffusion and Flow Models | 提出分层风险抽样(SHS),最小化CTMC/DTMC离散扩散模型的事件调度方差,提升生成质量。 | flow matching multimodal |