So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Phân đoạn ngữ nghĩa có thể giải thích× | Cơ chế chú ý (Attention Mechanism)× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2019–2021 | 2015 |
| Người khởi xướng≠ | Combination: Long et al. (FCN) + Selvaraju et al. (Grad-CAM); formalized as a unified paradigm ~2019–2021 | Bahdanau, D.; Luong, M.T. |
| Loại≠ | Explainable deep learning pipeline | Neural attention layer (encoder-decoder) |
| Công trình gốc≠ | Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-CAM: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 618–626. DOI ↗ | Bahdanau, D., Cho, K. & Bengio, Y. (2015). Neural Machine Translation by Jointly Learning to Align and Translate. ICLR. link ↗ |
| Tên gọi khác≠ | XSS, interpretable semantic segmentation, explainable scene parsing, transparent pixel-wise classification | Dikkat Mekanizması (Bahdanau / Luong Attention), dikkat mekanizmasi, neural attention, additive attention |
| Liên quan≠ | 4 | 5 |
| Tóm tắt≠ | Explainable Semantic Segmentation (XSS) couples pixel-wise scene parsing — assigning a class label to every pixel in an image — with post-hoc or intrinsic explanation methods such as Grad-CAM, attention maps, or SHAP, so that the network's class decisions can be audited, visualized, and justified to domain experts in medical imaging, autonomous driving, and remote sensing. | The attention mechanism, introduced by Bahdanau, Cho and Bengio in 2015 and refined by Luong, Pham and Manning the same year, lets a sequence decoder dynamically learn which of the encoder's outputs to focus on at each step. Before the Transformer, it substantially improved machine-translation quality by freeing models from compressing an entire input into a single fixed vector. |
| ScholarGateBộ dữ liệu ↗ |
|
|