| 1 |
Discrete Prompt Tuning via Recursive Utilization of Black-box Multimodal Large Language Model for Personalized Visual Emotion Recognition |
提出离散提示调优以解决个性化视觉情感识别问题 |
large language model multimodal |
|
|
| 2 |
GIER: Gap-Driven Self-Refinement for Large Language Models |
提出GIER框架以提升大型语言模型输出质量 |
large language model chain-of-thought |
|
|
| 3 |
The Resurgence of GCG Adversarial Attacks on Large Language Models |
提出GCG对大语言模型的对抗攻击评估方法 |
large language model |
|
|
| 4 |
Wage Sentiment Indices Derived from Survey Comments via Large Language Models |
提出工资情感指数以预测日本工资动态 |
large language model |
|
|
| 5 |
No Clustering, No Routing: How Transformers Actually Process Rare Tokens |
揭示Transformer如何处理稀有词汇以提升预测能力 |
large language model |
|
|
| 6 |
Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting |
提出角色提示优化方法以解决对话代理过度发言问题 |
large language model |
✅ |
|
| 7 |
TECP: Token-Entropy Conformal Prediction for LLMs |
提出TECP以解决大语言模型的不确定性量化问题 |
large language model |
|
|
| 8 |
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute |
提出ParaThinker以解决大语言模型推理效率瓶颈问题 |
large language model |
|
|
| 9 |
Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling |
提出基准测试以提升LLM推理效率 |
large language model |
|
|
| 10 |
Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems? |
提出多轮自我精炼单代理语言模型以解决复杂编程问题 |
chain-of-thought |
✅ |
|
| 11 |
Probe-Rewrite-Evaluate: A Workflow for Reliable Benchmarks and Quantifying Evaluation Awareness |
提出Probe-Rewrite-Evaluate方法以解决评估意识问题 |
large language model |
|
|
| 12 |
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment |
提出机制性洞察以解决推理引发的失调问题 |
large language model |
|
|
| 13 |
Modeling Motivated Reasoning in Law: Evaluating Strategic Role Conditioning in LLM Summarization |
提出基于角色条件的LLM摘要评估框架以应对法律动机推理问题 |
large language model |
|
|
| 14 |
GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction |
提出GraphKV以解决KV缓存管理中的动态选择问题 |
large language model |
|
|
| 15 |
KG-RAG: Enhancing GUI Agent Decision-Making via Knowledge Graph-Driven Retrieval-Augmented Generation |
提出KG-RAG框架以提升GUI代理的决策能力 |
large language model |
|
|