| 23 |
Learning Unified Representation of 3D Gaussian Splatting |
提出基于连续子流形场的3D高斯溅射统一表征方法,提升神经网络学习效率。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 24 |
Polysemous Language Gaussian Splatting via Matching-based Mask Lifting |
提出MUSplat,通过匹配的掩码提升实现多义语言高斯溅射,无需场景重训练。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 25 |
Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics |
提出轻量级结构化多模态推理框架,用于机器人临床场景理解 |
scene understanding multimodal chain-of-thought |
|
|
| 26 |
Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach |
提出一种开放词汇、多方面、可扩展的视觉情感评估方法,用于评估多模态大语言模型的情感理解能力。 |
open-vocabulary open vocabulary large language model |
✅ |
|
| 27 |
Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting |
利用2D高斯溅射压缩图像表示实现视觉-语言对齐 |
gaussian splatting splatting multimodal |
|
|
| 28 |
EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model |
EfficientDepth:一种快速且保留细节的单目深度估计模型 |
depth estimation monocular depth geometric consistency |
|
|
| 29 |
GS-2M: Gaussian Splatting for Joint Mesh Reconstruction and Material Decomposition |
GS-2M:基于高斯溅射的联合网格重建与材质分解方法 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 30 |
CCNeXt: An Effective Self-Supervised Stereo Depth Estimation Approach |
提出CCNeXt,一种高效的自监督立体深度估计方法,在计算成本和精度间取得平衡。 |
depth estimation stereo depth |
✅ |
|
| 31 |
Spatial Reasoning in Foundation Models: Benchmarking Object-Centric Spatial Understanding |
提出系统基准以解决视觉模型空间理解不足问题 |
scene understanding foundation model |
|
|
| 32 |
UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective |
UrbanFeel:提出一个综合性城市街景理解benchmark,关注时序变化和人类感知。 |
scene understanding large language model multimodal |
|
|
| 33 |
DeLiVR: Differential Spatiotemporal Lie Bias for Efficient Video Deraining |
DeLiVR:利用时空Lie群微分偏置实现高效视频去雨 |
optical flow spatiotemporal |
|
|
| 34 |
SingRef6D: Monocular Novel Object Pose Estimation with a Single RGB Reference |
SingRef6D:基于单张RGB参考图像的新物体单目6D位姿估计 |
Depth Anything 6D pose estimation spatial relationship |
|
|
| 35 |
Large Material Gaussian Model for Relightable 3D Generation |
提出Large Material Gaussian Model,实现可动态光照的3D内容生成,解决现有方法材质属性缺失问题。 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 36 |
Drag4D: Align Your Motion with Text-Driven 3D Scene Generation |
Drag4D:提出文本驱动的3D场景生成框架,实现交互式物体运动控制 |
gaussian splatting splatting |
|
|
| 37 |
Dynamic Novel View Synthesis in High Dynamic Range |
提出HDR-4DGS,解决高动态范围动态场景的新视角合成问题。 |
gaussian splatting splatting |
|
|
| 38 |
DualFocus: Depth from Focus with Spatio-Focal Dual Variational Constraints |
DualFocus:利用空域-焦域双重变分约束的景深估计方法 |
depth estimation |
|
|