| 1 |
Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments |
提出Dy3DGS-SLAM以解决动态环境下单目SLAM问题 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 2 |
Pts3D-LLM: Studying the Impact of Token Structure for 3D Scene Understanding With Large Language Models |
提出Pts3D-LLM以提升3D场景理解的效果 |
scene understanding large language model multimodal |
|
|
| 3 |
GS4: Generalizable Sparse Splatting Semantic SLAM |
提出GS4以解决传统SLAM在语义映射中的不足问题 |
gaussian splatting splatting semantic mapping |
|
|
| 4 |
Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models |
提出基于迁移学习和零样本模型的纺织品回收自动化分析方法 |
open-vocabulary open vocabulary foundation model |
|
|
| 5 |
STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving |
提出STSBench以解决多模态大语言模型在自动驾驶中的时空推理问题 |
scene understanding large language model |
|
|
| 6 |
Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection |
提出迭代视觉基础框架以解决视觉关系检测的泛化问题 |
scene understanding embodied AI large language model |
|
|
| 7 |
Reconstructing Heterogeneous Biomolecules via Hierarchical Gaussian Mixtures and Part Discovery |
提出CryoSPIRE以解决冷冻电子显微镜中生物分子重建问题 |
gaussian splatting splatting scene reconstruction |
|
|
| 8 |
HMVLM: Multistage Reasoning-Enhanced Vision-Language Model for Long-Tailed Driving Scenarios |
提出HMVLM以解决长尾驾驶场景中的决策问题 |
scene understanding chain-of-thought |
|
|
| 9 |
Aerial Multi-View Stereo via Adaptive Depth Range Inference and Normal Cues |
提出自适应深度范围MVS以解决航空多视图立体重建问题 |
depth estimation feature matching |
|
|
| 10 |
Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration |
提出Token Transforming框架以加速视觉Transformer并减少信息损失 |
depth estimation |
|
|