Machine learningDeep learning / NLP / CV

Explainable GAN

Explainable GAN applies interpretability techniques to Generative Adversarial Networks to reveal which internal units and latent directions cause specific visual or structural features in generated outputs. It combines GAN training with post-hoc analysis tools — such as unit dissection, saliency maps, or disentangled latent spaces — to make generative model behaviour transparent and auditable.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Bau, D., Zhu, J.-Y., Strobelt, H., Zhou, B., Tenenbaum, J. B., Freeman, W. T., & Torralba, A. (2019). GAN Dissection: Visualizing and Understanding Generative Adversarial Networks. In Proceedings of the International Conference on Learning Representations (ICLR 2019). link
  2. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative Adversarial Nets. In Advances in Neural Information Processing Systems (NeurIPS 2014), 27. link

Related methods

Referenced by

ScholarGateExplainable GAN (Explainable Generative Adversarial Network). Retrieved 2026-06-04 from https://scholargate.app/en/deep-learning/explainable-gan