So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Mở rộng từ viết tắt× | Trích xuất thông tin× | |
|---|---|---|
| Lĩnh vực | Khai phá văn bản | Khai phá văn bản |
| Họ | Process / pipeline | Process / pipeline |
| Năm ra đời≠ | 2003 | — |
| Người khởi xướng≠ | Schwartz & Hearst (2003) — seminal algorithm for biomedical abbreviation detection | — |
| Loại≠ | NLP disambiguation pipeline | NLP structured-information task |
| Công trình gốc≠ | Schwartz, A.S. & Hearst, M.A. (2003). A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing (PSB), 8, 451-462. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| Tên gọi khác≠ | acronym resolution, abbreviation disambiguation, short-form expansion, Kısaltma ve Akronim Çözümleme | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| Liên quan | 4 | 4 |
| Tóm tắt≠ | Abbreviation and acronym resolution is a natural-language-processing pipeline that maps each short form in a text to its full-length definition using contextual cues from the surrounding text. It is especially important in medical, legal, and technical documents, where the same acronym may carry entirely different meanings across domains. The field's foundational algorithm was published by Schwartz and Hearst (2003) for biomedical literature and has since been extended by neural and transformer-based approaches. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGateBộ dữ liệu ↗ |
|
|