cs.CV(2025-10-26)
📊 共 20 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗4)
支柱三:空间感知与语义 (Perception & Semantics) (4)
支柱二:RL算法与架构 (RL & Architecture) (4)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱三:空间感知与语义 (Perception & Semantics) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | LVD-GS: Gaussian Splatting SLAM for Dynamic Scenes via Hierarchical Explicit-Implicit Representation Collaboration Rendering | LVD-GS:面向动态场景,基于分层显隐式表达协同渲染的Gaussian Splatting SLAM | 3D gaussian splatting gaussian splatting splatting | ||
| 14 | Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views | 提出Look and Tell数据集,用于研究以自我为中心和以外部为中心视角下的多模态指示交流。 | scene reconstruction egocentric multimodal | ||
| 15 | DynaPose4D: High-Quality 4D Dynamic Content Generation via Pose Alignment Loss | DynaPose4D:提出基于姿态对齐损失的高质量4D动态内容生成方法 | 3D gaussian splatting gaussian splatting splatting | ||
| 16 | Seeing the Unseen: Towards Zero-Shot Inspection for Wind Turbine Blades using Knowledge-Augmented Vision Language Models | 提出基于知识增强视觉语言模型的零样本风力涡轮机叶片缺陷检测方法 | open-vocabulary open vocabulary multimodal |
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Edge Collaborative Gaussian Splatting with Integrated Rendering and Communication | 提出ECO-GS,通过边缘协同高斯溅射提升低成本设备渲染质量 | imitation learning gaussian splatting splatting | ||
| 18 | Mutual Information guided Visual Contrastive Learning | 提出互信息引导的视觉对比学习,提升表征学习在开放环境下的泛化性 | representation learning contrastive learning | ||
| 19 | Alias-Free ViT: Fractional Shift Invariance via Linear Attention | 提出Alias-Free ViT,通过线性注意力实现分数平移不变性,提升ViT的鲁棒性。 | linear attention | ||
| 20 | Single-Teacher View Augmentation: Boosting Knowledge Distillation via Angular Diversity | 提出基于单教师视角增强的知识蒸馏方法,通过角度多样性提升学生模型性能。 | distillation |