cs.CV(2025-10-14)

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (3) 支柱三:空间感知与语义 (Perception & Semantics) (2 🔗1) 支柱二:RL算法与架构 (RL & Architecture) (1) 支柱四:生成式动作 (Generative Motion) (1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
1 CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection CrossRay3D:通过几何与分布引导提升多模态3D检测效率 multimodal
2 IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation IL3D:用于LLM驱动的3D场景生成的大规模室内布局数据集 large language model multimodal
3 MultiFoodhat: A potential new paradigm for intelligent food quality inspection 提出MultiFoodChat,用于零样本食物识别的对话驱动多智能体推理框架。 large language model

🔬 支柱三:空间感知与语义 (Perception & Semantics) (2 篇)

#题目一句话要点标签🔗
4 DrivingScene: A Multi-Task Online Feed-Forward 3D Gaussian Splatting Method for Dynamic Driving Scenes 提出DrivingScene,用于动态驾驶场景的多任务在线前馈3D高斯溅射重建。 3D gaussian splatting gaussian splatting splatting
5 G4Splat: Geometry-Guided Gaussian Splatting with Generative Prior G4Splat:利用生成先验和几何引导的高质量高斯溅射场景重建 gaussian splatting splatting scene reconstruction

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
6 DRL: Discriminative Representation Learning with Parallel Adapters for Class Incremental Learning 提出DRL框架以解决增量学习中的表示转移问题 DRL representation learning

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
7 Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback Playmate2:基于扩散Transformer和奖励反馈的免训练多角色音频驱动动画 classifier-free guidance character animation foundation model

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
8 Hardware-aware Coding Function Design for Compressive Single-Photon 3D Cameras 针对单光子3D相机硬件约束,提出硬件感知的编码函数设计方法 PULSE

⬅️ 返回 cs.CV 首页 · 🏠 返回主页