ScholarGate
Assistent

Compara mètodes

Revisa els mètodes seleccionats l'un al costat de l'altre; les files que difereixen es ressalten.

Detecció d'objectes explicable×Explainable Vision Transformer×
CampAprenentatge profundAprenentatge profund
FamíliaMachine learningMachine learning
Any d'origen2016–20172021
Autor originalSelvaraju et al. (Grad-CAM); Ribeiro et al. (LIME); Lundberg & Lee (SHAP)Chefer, H., Gur, S., & Wolf, L. (attribution framework); Dosovitskiy et al. (base ViT)
TipusPost-hoc explainability applied to object detectionPost-hoc explainability applied to Vision Transformer
Font seminalSelvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 618–626. DOI ↗Chefer, H., Gur, S., & Wolf, L. (2021). Transformer interpretability beyond attention visualization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 782–791. DOI ↗
ÀliesXAI Object Detection, Interpretable Object Detection, Transparent Object Detection, Explainable ODXViT, Interpretable ViT, Explainable ViT, Transparent Vision Transformer
Relacionats55
ResumExplainable object detection combines a deep-learning object detector — such as YOLO, Faster R-CNN, or DETR — with post-hoc or built-in explainability methods (Grad-CAM, LIME, SHAP, D-RISE) that visualize why the model placed a bounding box at a particular location and assigned a particular class label, making its decisions auditable by humans.Explainable Vision Transformer combines the strong image-recognition performance of Vision Transformers (ViT) with attribution techniques — such as relevance propagation, attention rollout, or gradient-weighted attention — that highlight which image regions drive each prediction. The approach enables researchers and practitioners to audit model decisions and satisfy transparency requirements without sacrificing accuracy.
ScholarGateConjunt de dades
  1. v1
  2. 2 Fonts
  3. PUBLISHED
  1. v1
  2. 2 Fonts
  3. PUBLISHED

Ves a la cerca Baixa les diapositives

ScholarGateCompara mètodes: Explainable Object Detection · Explainable Vision Transformer. Recuperat el 2026-06-15 de https://scholargate.app/ca/compare