cs.LG(2025-08-07)

📊 共 5 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱九:具身大模型 (Embodied Foundation Models) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 SPaRFT: Self-Paced Reinforcement Fine-Tuning for Large Language Models 提出SPaRFT以解决大语言模型训练效率低下问题 reinforcement learning curriculum learning large language model
2 R-Zero: Self-Evolving Reasoning LLM from Zero Data 提出R-Zero以解决自我进化推理模型的数据依赖问题 reinforcement learning large language model
3 Anti-Jamming Sensing with Distributed Reconfigurable Intelligent Metasurface Antennas 提出分布式可重构智能超表面天线以解决抗干扰感知问题 reinforcement learning deep reinforcement learning DRL

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
4 Disentangling Bias by Modeling Intra- and Inter-modal Causal Attention for Multimodal Sentiment Analysis 提出MMCI模型以解决多模态情感分析中的偏差问题 multimodal
5 A Metric for MLLM Alignment in Large-scale Recommendation 提出泄漏影响评分以解决多模态推荐系统对齐问题 large language model multimodal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页