Today we know what the world looks like. We've taken pictures of it from space! We know there are many countries, some far away, and that there are lots of different people. But for thousands of ...
Given that recent research has demonstrated the efficacy of large-scale vision and language pretrained (VLP) models in various tasks, we rely on features from the OpenAI CLIP model to tackle the ...