cs.CL(2025-09-07)
📊 共 2 篇论文
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Beyond I'm Sorry, I Can't: Dissecting Large Language Model Refusal | 利用稀疏自编码器剖析大语言模型拒绝行为并实现越狱 | large language model | ||
| 2 | Let's Roleplay: Examining LLM Alignment in Collaborative Dialogues | 提出基于角色扮演的LLM对齐评估框架,提升多方对话协作中的决策质量 | large language model |