Machine learningDeep learning / NLP / CV

Explainable Sentence Embeddings

Explainable sentence embeddings combine dense sentence representation learning with post-hoc or intrinsic interpretability tools — such as probing classifiers, LIME, SHAP, or attention attribution — to reveal what linguistic and semantic information is encoded in a sentence vector and why a downstream model makes a given prediction. The goal is to retain the representational power of modern encoders while making their behavior auditable.

MethodMind'de açSoonVideoSoon

Tam yöntemi oku

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Conneau, A., Kruszewski, G., Lample, G., Barrault, L., & Baroni, M. (2018). What you can cram into a single $\vec{v}$ector: Probing sentence embeddings for linguistic properties. In Proceedings of ACL 2018, pp. 2126–2136. link
  2. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why Should I Trust You?": Explaining the predictions of any classifier. In Proceedings of KDD 2016, pp. 1135–1144. DOI: 10.1145/2939672.2939778

Related methods

ScholarGateExplainable Sentence Embeddings (Explainable Sentence Embeddings (Interpretable Dense Sentence Representations)). Retrieved 2026-06-04 from https://scholargate.app/tr/deep-learning/explainable-sentence-embeddings