cs.LG（2025-08-24）

📊 共 2 篇论文

🎯 兴趣领域导航

#	题目	一句话要点	标签	🔗	⭐
1	TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling	提出TreePO以解决强化学习推理效率与效果之间的矛盾	reinforcement learning large language model

#	题目	一句话要点	标签	🔗	⭐
2	LLM Assertiveness can be Mechanistically Decomposed into Emotional and Logical Components	通过情感与逻辑成分分解LLM自信度以应对过度自信问题	large language model