Visual contrastive learning is a self-supervised deep-learning approach — popularised by frameworks such as SimCLR (Chen et al., 2020) and MoCo (He et al., 2020) — that learns rich image representations without labels by pulling different augmentations of the same image together and pushing different images apart. It turns a large pool of unlabelled images into a useful feature extractor.