Porovnať metódy
Prezrite si vybrané metódy vedľa seba; riadky, ktoré sa líšia, sú zvýraznené.
| Rozšírenie skratiek× | Extrakcia informácií× | |
|---|---|---|
| Odbor | Dolovanie textu | Dolovanie textu |
| Rodina | Process / pipeline | Process / pipeline |
| Rok vzniku≠ | 2003 | — |
| Tvorca≠ | Schwartz & Hearst (2003) — seminal algorithm for biomedical abbreviation detection | — |
| Typ≠ | NLP disambiguation pipeline | NLP structured-information task |
| Pôvodný zdroj≠ | Schwartz, A.S. & Hearst, M.A. (2003). A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing (PSB), 8, 451-462. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| Ďalšie názvy≠ | acronym resolution, abbreviation disambiguation, short-form expansion, Kısaltma ve Akronim Çözümleme | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| Príbuzné | 4 | 4 |
| Zhrnutie≠ | Abbreviation and acronym resolution is a natural-language-processing pipeline that maps each short form in a text to its full-length definition using contextual cues from the surrounding text. It is especially important in medical, legal, and technical documents, where the same acronym may carry entirely different meanings across domains. The field's foundational algorithm was published by Schwartz and Hearst (2003) for biomedical literature and has since been extended by neural and transformer-based approaches. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGateDátová sada ↗ |
|
|