TULIP (Token-length Upgraded CLIP) is a method to upgrade the caption length of CLIP-like models to perform long caption understanding. This repository contains the code associated with the paper: For ...
The official implementation of CLIP-EBC, proposed in the paper CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification. @article{ma2024clip, title={CLIP-EBC: CLIP Can Count ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results