| 10 |
VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction |
VGD:用于前馈环视驾驶场景重建的视觉几何高斯溅射 |
gaussian splatting splatting scene reconstruction |
|
|
| 11 |
Extreme Views: 3DGS Filter for Novel View Synthesis from Out-of-Distribution Camera Poses |
提出基于梯度的3DGS滤波方法,解决极端视角下新视角合成的伪影问题 |
3D gaussian splatting 3DGS gaussian splatting |
✅ |
|
| 12 |
Advances in 4D Representation: Geometry, Motion, and Interaction |
针对4D生成与重建,提出基于几何、运动和交互的4D表征方法综述。 |
3D gaussian splatting 3DGS gaussian splatting |
✅ |
|
| 13 |
A Training-Free Framework for Open-Vocabulary Image Segmentation and Recognition with EfficientNet and CLIP |
提出一种基于EfficientNet和CLIP的无训练开放词汇图像分割与识别框架 |
open-vocabulary open vocabulary |
|
|
| 14 |
Toward A Better Understanding of Monocular Depth Evaluation |
提出单目深度估计评估新指标,提升与人类感知的对齐性 |
depth estimation monocular depth |
✅ |
|
| 15 |
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance Fields |
AegisRF:利用敏感度引导的对抗扰动保护NeRF的知识产权 |
NeRF neural radiance field |
✅ |
|
| 16 |
A Matter of Time: Revealing the Structure of Time in Vision-Language Models |
提出TIME10k基准,揭示视觉-语言模型中时间信息的低维非线性结构,并构建时间轴表示。 |
open-vocabulary open vocabulary multimodal |
✅ |
|
| 17 |
Exploring Scale Shift in Crowd Localization under the Context of Domain Generalization |
针对人群定位中尺度偏移问题,提出因果特征解耦和异构处理方法,提升领域泛化能力。 |
scene understanding |
|
|