in Zeroshot_with_CLIP.ipynb we build a zero-shop classifier using the pretrained CLIP network and improve its performance with descriptors generated with GPT.
CLIP Learning Transferable Visual Models From Natural Language Supervision (ICML 2021) Alec Radford et al.
Visual Classification via Description from Large Language Models (ICLR 2023) Menon, Sachit and Vondrick, Carl