Compara mètodes
Revisa els mètodes seleccionats l'un al costat de l'altre; les files que difereixen es ressalten.
| Aprenentatge per transferència amb segmentació d'instàncies× | Aprenentatge per transferència amb classificació d'imatges× | |
|---|---|---|
| Camp | Aprenentatge profund | Aprenentatge profund |
| Família | Machine learning | Machine learning |
| Any d'origen≠ | 2017 (Mask R-CNN); transfer learning paradigm: 2010 | 2010–2012 |
| Autor original≠ | He, K. et al. (Mask R-CNN); transfer learning framework: Pan & Yang | Pan, S. J. & Yang, Q. (transfer learning framework); Krizhevsky, Sutskever & Hinton (deep CNN backbone) |
| Tipus≠ | Transfer learning applied to instance segmentation | Transfer learning / supervised classification |
| Font seminal≠ | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ | Pan, S. J., & Yang, Q. (2010). A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering, 22(10), 1345–1359. DOI ↗ |
| Àlies | pretrained instance segmentation, fine-tuned Mask R-CNN, transfer learning for panoptic segmentation, domain-adapted instance segmentation | pretrained CNN image classification, fine-tuned image classifier, domain-adapted image classifier, TL-IC |
| Relacionats | 4 | 4 |
| Resum≠ | Transfer learning with instance segmentation reuses a backbone convolutional network pretrained on a large image corpus (typically ImageNet or COCO) as the feature extractor for an instance segmentation model such as Mask R-CNN, then fine-tunes the full pipeline on a smaller target dataset. This approach delivers state-of-the-art per-object mask accuracy with a fraction of the labeled data and compute that training from scratch would require. | Transfer Learning with Image Classification reuses a deep neural network backbone — typically a CNN or Vision Transformer — pretrained on a large dataset such as ImageNet, and adapts it to classify images in a new target domain. By inheriting general visual features from the source task, the approach achieves high accuracy with far fewer labeled images than training from scratch. |
| ScholarGateConjunt de dades ↗ |
|
|