TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

Abstract

TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

Publication
In Conference on Computer Vision and Pattern Recognition (CVPR) 2026
Date