| 1 |
Generalist versus Specialist Vision Foundation Models for Ocular Disease and Oculomics |
领域专精的RETFound在眼科疾病和眼基因组学任务中优于通用视觉基础模型 |
MAE foundation model |
|
|
| 2 |
RTGMFF: Enhanced fMRI-based Brain Disorder Diagnosis via ROI-driven Text Generation and Multimodal Feature Fusion |
提出RTGMFF框架以提升fMRI脑部疾病诊断准确性 |
Mamba multimodal |
✅ |
|
| 3 |
Resilient Multimodal Industrial Surface Defect Detection with Uncertain Sensors Availability |
提出一种鲁棒的多模态工业表面缺陷检测方法,解决传感器可用性不确定问题。 |
contrastive learning multimodal |
✅ |
|
| 4 |
Empowering Lightweight MLLMs with Reasoning via Long CoT SFT |
长CoT SFT赋能轻量级MLLM推理能力 |
reinforcement learning multimodal chain-of-thought |
|
|
| 5 |
AIVA: An AI-based Virtual Companion for Emotion-aware Interaction |
AIVA:一种基于AI的情感感知交互虚拟助手 |
contrastive learning large language model multimodal |
|
|
| 6 |
Teacher-Student Model for Detecting and Classifying Mitosis in the MIDOG 2025 Challenge |
提出基于Teacher-Student模型的有丝分裂检测与分类方法,提升领域泛化性。 |
representation learning teacher-student |
|
|
| 7 |
PPORLD-EDNetLDCT: A Proximal Policy Optimization-Based Reinforcement Learning Framework for Adaptive Low-Dose CT Denoising |
提出基于近端策略优化的强化学习框架PPORLD-EDNetLDCT,用于自适应低剂量CT降噪。 |
reinforcement learning PPO |
|
|
| 8 |
Multi Attribute Bias Mitigation via Representation Learning |
提出GMBM框架,通过表征学习缓解视觉模型中的多重属性偏差问题 |
representation learning |
✅ |
|
| 9 |
PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection |
PointAD+:学习分层表示,实现零样本3D异常检测 |
representation learning spatial relationship |
|
|
| 10 |
Towards Efficient General Feature Prediction in Masked Skeleton Modeling |
提出通用特征预测框架,加速并提升掩码骨骼建模的动作识别性能。 |
masked autoencoder MAE |
|
|