| 1 |
Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences? |
提出偏好一致性测量方法以解决LLM行为与人类价值不一致问题 |
large language model |
|
|
| 2 |
FinBERT2: A Specialized Bidirectional Encoder for Bridging the Gap in Finance-Specific Deployment of Large Language Models |
提出FinBERT2以解决金融领域大语言模型应用不足问题 |
large language model |
|
|
| 3 |
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding |
提出PMF-CEC以解决ASR错误纠正中的同音词问题 |
multimodal |
|
|
| 4 |
CMT-LLM: Contextual Multi-Talker ASR Utilizing Large Language Models |
提出CMT-LLM框架以解决多说话者ASR与上下文偏置问题 |
large language model |
|
|
| 5 |
ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation |
提出ChartGen以解决图表理解中的合成数据生成问题 |
large language model multimodal |
✅ |
|
| 6 |
Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment |
提出机器对抗机器的方法以应对生成式AI在评估中的威胁 |
large language model multimodal |
|
|
| 7 |
Position: Olfaction Standardization is Essential for the Advancement of Embodied Artificial Intelligence |
呼吁标准化嗅觉研究以推动具身人工智能发展 |
multimodal |
|
|
| 8 |
CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning |
提出CodeSense以解决代码语义推理基准不足问题 |
chain-of-thought |
✅ |
|
| 9 |
RFCAudit: An LLM Agent for Functional Bug Detection in Network Protocols |
提出RFCAudit以解决网络协议功能性错误检测问题 |
large language model |
|
|
| 10 |
Organizational Adaptation to Generative AI in Cybersecurity: A Systematic Review |
提出生成性人工智能集成框架以应对网络安全挑战 |
large language model |
|
|
| 11 |
AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents |
提出AgentAuditor以解决LLM代理安全性评估问题 |
chain-of-thought |
✅ |
|
| 12 |
MIRROR: Modular Internal Processing for Personalized Safety in LLM Dialogue |
提出MIRROR以解决大型语言模型对用户安全的忽视问题 |
large language model |
|
|
| 13 |
Wide Reflective Equilibrium in LLM Alignment: Bridging Moral Epistemology and AI Safety |
提出广泛反射平衡方法以增强大型语言模型的对齐安全性 |
large language model |
|
|