| 21 |
Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting |
提出一种RGB-LiDAR到3D高斯溅射的转换方法,用于高效构建大规模数字孪生。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 22 |
EntON: Eigenentropy-Optimized Neighborhood Densification in 3D Gaussian Splatting |
EntON:基于特征熵优化的3D高斯溅射邻域稠密化,提升几何精度与渲染质量。 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 23 |
CylinderSplat: 3D Gaussian Splatting with Cylindrical Triplanes for Panoramic Novel View Synthesis |
CylinderSplat:利用柱面Triplane的3D高斯溅射实现全景新视角合成 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 24 |
VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction |
VG3S:利用视觉几何先验的高斯溅射实现语义占据预测 |
3D gaussian splatting gaussian splatting splatting |
|
|
| 25 |
FTSplat: Feed-forward Triangle Splatting Network |
提出FTSplat,通过前馈三角形splatting网络实现高效三维重建 |
3D gaussian splatting 3DGS gaussian splatting |
|
|
| 26 |
NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving |
NOVA:面向自动驾驶,提出基于开放词汇自回归的3D多目标跟踪方法 |
open-vocabulary open vocabulary large language model |
✅ |
|
| 27 |
JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and Panoramas |
JOPP-3D:联合点云与全景图的开放词汇语义分割框架 |
scene understanding open-vocabulary open vocabulary |
|
|
| 28 |
DeepSight: Bridging Depth Maps and Language with a Depth-Driven Multimodal Model |
DeepSight:首个深度驱动的多模态模型,弥合深度图与语言之间的鸿沟,提升三维场景理解。 |
scene understanding large language model multimodal |
|
|
| 29 |
Exploring Open-Vocabulary Object Recognition in Images using CLIP |
提出基于CLIP的开放词汇目标识别框架,无需复杂训练且泛化性强 |
open-vocabulary open vocabulary |
|
|
| 30 |
Spectral Probing of Feature Upsamplers in 2D-to-3D Scene Reconstruction |
提出频谱诊断框架,评估2D-to-3D重建中特征上采样方法对3D感知的贡献 |
scene reconstruction geometric consistency foundation model |
|
|
| 31 |
FreeOcc: Training-free Panoptic Occupancy Prediction via Foundation Models |
FreeOcc:利用预训练模型实现免训练的全景占据预测 |
scene understanding foundation model |
|
|
| 32 |
EventGeM: Global-to-Local Feature Matching for Event-Based Visual Place Recognition |
EventGeM:用于事件相机视觉定位的全局到局部特征匹配方法 |
depth estimation feature matching foundation model |
|
|
| 33 |
RePer-360: Releasing Perspective Priors for 360$^\circ$ Depth Estimation via Self-Modulation |
RePer-360:通过自调制释放透视先验,用于360°深度估计 |
depth estimation foundation model |
|
|
| 34 |
AV-Unified: A Unified Framework for Audio-visual Scene Understanding |
提出AV-Unified统一框架,用于多任务联合学习的音视频场景理解 |
scene understanding spatiotemporal |
|
|
| 35 |
CHMv2: Improvements in Global Canopy Height Mapping using DINOv3 |
CHMv2:利用DINOv3改进全球冠层高度图绘制,提升精度与细节 |
depth estimation height map |
|
|
| 36 |
PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction |
PixARMesh:提出一种自回归网格原生单视图场景重建方法 |
scene reconstruction |
|
|
| 37 |
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic Image |
Pano3DComposer:基于单张全景图像的前馈式可组合3D场景生成 |
VGGT geometric consistency |
✅ |
|
| 38 |
Rewis3d: Reconstruction Improves Weakly-Supervised Semantic Segmentation |
Rewis3d:利用3D重建提升弱监督语义分割性能 |
scene reconstruction |
|
|