cs.CV（2023-12-27）

📊 共 7 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (2) 支柱一：机器人控制 (Robot Control) (1 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (1 🔗1) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱四：生成式动作 (Generative Motion) (1 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey	视觉指令调优综述：迈向通用多模态模型	multimodal instruction following
2	Blind Image Quality Assessment: A Brief Survey	综述性分析：对无参考图像质量评估（BIQA）的最新进展进行全面分析与讨论。	multimodal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
3	SVGDreamer: Text Guided SVG Generation with Diffusion Model	SVGDreamer：提出一种基于扩散模型的文本引导SVG生成方法，提升可编辑性、视觉质量和多样性。	manipulation dreamer distillation	✅

🔬 支柱二：RL算法与架构 (RL & Architecture) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
4	X Modality Assisting RGBT Object Tracking	提出X-Net，通过跨模态辅助提升RGBT目标跟踪的鲁棒性和精度。	distillation optical flow interaction transformer	✅

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
5	HMP: Hand Motion Priors for Pose and Shape Estimation from Video	提出基于手部运动先验的HMP模型，用于视频中的手部姿态和形状估计。	latent optimization

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
6	In-Hand 3D Object Reconstruction from a Monocular RGB Video	提出基于单目RGB视频的手持物体三维重建方法，解决接触区域遮挡问题	penetration	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
7	City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web	City-on-Web：首个Web端大规模场景实时神经渲染方法	neural radiance field

⬅️ 返回 cs.CV 首页 · 🏠 返回主页