cs.CV(2025-11-23)

📊 共 13 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱三:空间感知 (Perception & SLAM) (9 🔗3) 支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱三:空间感知 (Perception & SLAM) (9 篇)

#题目一句话要点标签🔗
1 ReCoGS: Real-time ReColoring for Gaussian Splatting scenes ReCoGS:高斯溅射场景的实时重新着色方法 gaussian splatting NeRF novel view synthesis
2 PhysGS: Bayesian-Inferred Gaussian Splatting for Physical Property Estimation PhysGS:基于贝叶斯推断的高斯溅射实现物理属性估计 3D gaussian splatting gaussian splatting
3 SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation SegSplat:提出一种前馈高斯溅射和开放集语义分割框架 3D gaussian splatting gaussian splatting
4 Alias-free 4D Gaussian Splatting 提出4D尺度自适应滤波与尺度损失,解决4D高斯溅射动态场景重建中的混叠伪影问题。 gaussian splatting scene reconstruction
5 Functional Localization Enforced Deep Anomaly Detection Using Fundus Images 利用眼底图像和功能定位增强的深度异常检测方法 localization
6 Gaze Beyond the Frame: Forecasting Egocentric 3D Visual Span EgoSpanLift:预测第一人称视角下的3D视觉范围,提升AR/VR体验。 SLAM scene understanding localization
7 4D-VGGT: A General Foundation Model with SpatioTemporal Awareness for Dynamic Scene Geometry Estimation 提出4D-VGGT,用于动态场景几何估计的时空感知通用基础模型 VGGT
8 SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes SwiftVGGT:一种可扩展的视觉几何约束Transformer,用于大规模场景三维重建。 VGGT
9 UniFlow: Towards Zero-Shot LiDAR Scene Flow for Autonomous Vehicles via Cross-Domain Generalization UniFlow:通过跨域泛化实现自动驾驶车辆的零样本LiDAR场景流估计 point cloud

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
10 CrossJEPA: Cross-Modal Joint-Embedding Predictive Architecture for Efficient 3D Representation Learning from 2D Images 提出CrossJEPA以解决3D表示学习中的2D图像数据稀缺问题 representation learning point cloud
11 RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models 分析Mamba视觉模型表征能力,揭示其与线性Transformer的关联 Mamba linear attention
12 HiFi-MambaV2: Hierarchical Shared-Routed MoE for High-Fidelity MRI Reconstruction HiFi-MambaV2:用于高保真MRI重建的分层共享路由MoE Mamba架构 Mamba

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
13 MimiCAT: Mimic with Correspondence-Aware Cascade-Transformer for Category-Free 3D Pose Transfer MimiCAT:基于对应感知级联Transformer的无类别3D姿态迁移 quadruped humanoid

⬅️ 返回 cs.CV 首页 · 🏠 返回主页