CVPR 2022 | X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval

Computer Vision and Pattern Recognition Conference
Download PDF