cs.CV（2023-12-30）

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (3 🔗1) 支柱三：空间感知与语义 (Perception & Semantics) (3) 支柱二：RL算法与架构 (RL & Architecture) (1) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Pushing Boundaries: Exploring Zero Shot Object Classification with Large Multimodal Models	探索大型多模态模型在零样本目标分类中的应用潜力	large language model multimodal
2	GazeCLIP: Enhancing Gaze Estimation Through Text-Guided Multimodal Learning	提出GazeCLIP以解决视觉注视估计中的语言信息不足问题	multimodal
3	Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation	提出DIS-SAM，提升SAM在二分图像分割任务中的精度，尤其在边界细节方面。	foundation model	✅

🔬 支柱三：空间感知与语义 (Perception & Semantics) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
4	3D Human Pose Perception from Egocentric Stereo Videos	提出基于Transformer的框架，利用场景信息和时序信息提升自中心立体视频中的3D人体姿态感知。	scene reconstruction egocentric
5	Inpaint4DNeRF: Promptable Spatio-Temporal NeRF Inpainting with Generative Diffusion Models	Inpaint4DNeRF：利用生成扩散模型实现可控的时空NeRF图像修复	NeRF neural radiance field
6	PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields	PlanarNeRF：提出一种在线学习的神经辐射场平面基元检测方法	neural radiance field

🔬 支柱二：RL算法与架构 (RL & Architecture) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
7	Explainability-Driven Leaf Disease Classification Using Adversarial Training and Knowledge Distillation	提出结合对抗训练、可解释性和知识蒸馏的叶片病害分类方法	distillation

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
8	SHARE: Single-view Human Adversarial REconstruction	提出SHARE对抗微调框架，提升单视角人体姿态与形状重建对不同相机角度的鲁棒性	HMR

⬅️ 返回 cs.CV 首页 · 🏠 返回主页