Machine learningDeep learning / NLP / CV
Self-supervised Object Detection
Self-supervised object detection uses unlabeled image data to pre-train a visual backbone through pretext tasks such as contrastive learning or masked image modeling, then fine-tunes the backbone with a detection head on a smaller labeled dataset. This approach dramatically reduces reliance on expensive bounding-box annotations while matching or approaching fully supervised detection performance.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- He, K., Fan, H., Wu, Y., Xie, S., & Girshick, R. (2020). Momentum Contrast for Unsupervised Visual Representation Learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9729–9738. DOI: 10.1109/CVPR42600.2020.00975 ↗
- Caron, M., Touvron, H., Misra, I., Jégou, H., Mairal, J., Bojanowski, P., & Joulin, A. (2021). Emerging Properties in Self-Supervised Vision Transformers. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 9650–9660. DOI: 10.1109/ICCV48922.2021.00951 ↗