cs.CV(2025-05-09)
📊 共 6 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)
支柱三:空间感知与语义 (Perception & Semantics) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Temperature-Driven Robust Disease Detection in Brain and Gastrointestinal Disorders via Context-Aware Adaptive Knowledge Distillation | 提出基于温度驱动的知识蒸馏框架以提高脑部和胃肠疾病检测的鲁棒性 | teacher-student distillation | ||
| 2 | Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation | 提出Topo-VM-UNetV2以解决多边形分割中的拓扑特征捕捉问题 | Mamba state space model | ||
| 3 | VIN-NBV: A View Introspection Network for Next-Best-View Selection | 提出VIN-NBV以解决复杂场景下的下一最佳视角选择问题 | reinforcement learning deep reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Adapting a Segmentation Foundation Model for Medical Image Classification | 提出一种新框架以适应SAM模型进行医学图像分类 | foundation model | ||
| 5 | MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | 提出MM-Skin以解决皮肤科多模态数据不足问题 | multimodal instruction following | ✅ |
🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles | 提出基于相机的鸟瞰视图感知框架以解决激光雷达依赖问题 | depth estimation monocular depth Depth Anything |