| 1 |
Fine-Scale Soil Mapping in Alaska with Multimodal Machine Learning |
提出MISO模型以解决阿拉斯加细尺度土壤制图问题 |
contrastive learning foundation model multimodal |
✅ |
|
| 2 |
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs |
提出SpeechRefer以解决噪声和模糊语音输入下的3D视觉定位问题 |
contrastive learning multimodal visual grounding |
|
|
| 3 |
Earth Observation Foundation Model PhilEO: Pretraining on the MajorTOM and FastTOM Datasets |
提出PhilEO以提升地球观测模型的预训练效率 |
Mamba SSM foundation model |
|
|
| 4 |
DETONATE: A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization |
提出DETONATE基准以优化文本到图像模型的对齐问题 |
DPO direct preference optimization large language model |
|
|
| 5 |
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning |
提出PeRL以解决多图像推理中的空间关系理解问题 |
reinforcement learning multimodal |
|
|
| 6 |
KDMOS:Knowledge Distillation for Motion Segmentation |
提出KDMOS以解决运动物体分割中的实时性与准确性问题 |
distillation scene flow |
✅ |
|
| 7 |
Align Your Flow: Scaling Continuous-Time Flow Map Distillation |
提出连续时间流图蒸馏方法以提升生成模型效率 |
flow matching distillation |
|
|
| 8 |
SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks |
提出SIRI-Bench以评估视觉语言模型的空间智能 |
reinforcement learning large language model |
|
|
| 9 |
Model compression using knowledge distillation with integrated gradients |
提出基于集成梯度的知识蒸馏方法以实现模型压缩 |
distillation |
|
|
| 10 |
HydroChronos: Forecasting Decades of Surface Water Change |
提出HydroChronos以解决水体动态预测数据不足问题 |
MAE spatiotemporal |
|
|