| 1 |
Hidden in Plain Sight: Reasoning in Underspecified and Misspecified Scenarios for Multimodal LLMs |
提出隐性推理方法以解决多模态大语言模型的不足问题 |
large language model multimodal |
|
|
| 2 |
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge |
提出MELT以解决情感数据标注的自动化问题 |
large language model multimodal |
|
|
| 3 |
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents |
提出Open CaptchaWorld以解决多模态LLM代理在CAPTCHA挑战中的不足 |
multimodal |
|
|
| 4 |
Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models |
提出AdaCVD以解决心血管疾病风险预测中的数据异质性问题 |
large language model |
|
|
| 5 |
The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features |
评估大型语言模型在地理特征表示中的可靠性 |
large language model |
|
|
| 6 |
Gated Multimodal Graph Learning for Personalized Recommendation |
提出RLMultimodalRec以解决多模态推荐中的融合挑战 |
multimodal |
|
|
| 7 |
Towards Scalable Schema Mapping using Large Language Models |
提出基于大语言模型的可扩展模式映射方法以解决数据集成挑战 |
large language model |
|
|
| 8 |
Generative AI for Urban Design: A Stepwise Approach Integrating Human Expertise with Multimodal Diffusion Models |
提出分步生成框架以提升城市设计中的人机协作 |
multimodal |
|
|
| 9 |
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation |
提出FABLE基准以评估大语言模型的数据流推理能力 |
large language model |
|
|
| 10 |
Evaluation of LLMs for mathematical problem solving |
评估大型语言模型在数学问题求解中的表现 |
large language model chain-of-thought |
|
|
| 11 |
Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success |
提出随机规则森林以解决创业成功预测问题 |
large language model |
|
|
| 12 |
Chances and Challenges of the Model Context Protocol in Digital Forensics and Incident Response |
提出模型上下文协议以解决数字取证中的透明性和可解释性问题 |
large language model |
|
|
| 13 |
MIR: Methodology Inspiration Retrieval for Scientific Research Problems |
提出方法论灵感检索以解决科学研究问题 |
large language model |
|
|
| 14 |
Whispers of Many Shores: Cultural Alignment through Collaborative Cultural Expertise |
提出软提示微调框架以解决文化对齐问题 |
large language model |
|
|
| 15 |
Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings |
提出DEEVO框架以优化大语言模型的提示工程 |
large language model |
|
|
| 16 |
A survey of using EHR as real-world evidence for discovering and validating new drug indications |
综述电子健康记录在新药适应症发现中的应用与挑战 |
large language model |
|
|
| 17 |
Memory OS of AI Agent |
提出MemoryOS以解决大语言模型的长期记忆管理问题 |
large language model |
✅ |
|
| 18 |
Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction |
提出NextLocMoE以解决个性化和语义感知的下一个位置预测问题 |
large language model |
|
|
| 19 |
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation |
提出基于知识图谱和大型语言模型的结构化虚假信息生成方法 |
large language model |
|
|
| 20 |
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning |
优化知识图谱与大语言模型接口以提升复杂推理能力 |
large language model |
|
|
| 21 |
LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs |
提出LPASS以提高压缩LLM在漏洞检测中的效率 |
large language model |
|
|
| 22 |
RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation |
提出RMoA以优化多智能体系统的效率与可靠性 |
large language model |
✅ |
|
| 23 |
GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments |
提出GridRoute基准以提升LLM在网格环境中的路径规划能力 |
large language model |
✅ |
|
| 24 |
TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents |
提出TRAPDOC以解决用户对大型语言模型的过度依赖问题 |
large language model |
✅ |
|
| 25 |
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules |
提出QuAda以解决大语言模型中的引用意识对话问题 |
large language model |
|
|
| 26 |
E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness |
提出E^2GraphRAG以解决图基RAG效率低下问题 |
large language model |
|
|
| 27 |
Learning API Functionality from In-Context Demonstrations for Tool-based Agents |
提出从上下文示例中学习API功能以解决文档缺失问题 |
large language model |
|
|