手法を比較
選択した手法を並べて確認できます。異なる行はハイライト表示されます。
| 説明可能な拡散モデル× | 説明可能なVision Transformer× | |
|---|---|---|
| 分野 | 深層学習 | 深層学習 |
| 系統 | Machine learning | Machine learning |
| 提唱年≠ | 2020–2022 | 2021 |
| 提唱者≠ | Ho, J., Jain, A., & Abbeel, P. (DDPM, 2020); XAI augmentation by subsequent researchers | Chefer, H., Gur, S., & Wolf, L. (attribution framework); Dosovitskiy et al. (base ViT) |
| 種類≠ | Generative model with post-hoc or intrinsic explainability | Post-hoc explainability applied to Vision Transformer |
| 原典≠ | Ho, J., Jain, A., & Abbeel, P. (2020). Denoising Diffusion Probabilistic Models. Advances in Neural Information Processing Systems, 33, 6840–6851. link ↗ | Chefer, H., Gur, S., & Wolf, L. (2021). Transformer interpretability beyond attention visualization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 782–791. DOI ↗ |
| 別名 | XAI-DDPM, interpretable diffusion model, transparent diffusion model, explainable DDPM | XViT, Interpretable ViT, Explainable ViT, Transparent Vision Transformer |
| 関連≠ | 6 | 5 |
| 概要≠ | An Explainable Diffusion Model couples a denoising diffusion probabilistic model with post-hoc or intrinsic explainability techniques — such as SHAP, gradient-based saliency, attention analysis, or concept-based probing — so that each generative or predictive decision can be audited and justified rather than treated as a black box. | Explainable Vision Transformer combines the strong image-recognition performance of Vision Transformers (ViT) with attribution techniques — such as relevance propagation, attention rollout, or gradient-weighted attention — that highlight which image regions drive each prediction. The approach enables researchers and practitioners to audit model decisions and satisfy transparency requirements without sacrificing accuracy. |
| ScholarGateデータセット ↗ |
|
|