Porovnat metody
Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.
| Rozšíření zkratek× | Extrakce informací× | |
|---|---|---|
| Obor | Dolování textu | Dolování textu |
| Rodina | Process / pipeline | Process / pipeline |
| Rok vzniku≠ | 2003 | — |
| Tvůrce≠ | Schwartz & Hearst (2003) — seminal algorithm for biomedical abbreviation detection | — |
| Typ≠ | NLP disambiguation pipeline | NLP structured-information task |
| Původní zdroj≠ | Schwartz, A.S. & Hearst, M.A. (2003). A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing (PSB), 8, 451-462. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| Další názvy≠ | acronym resolution, abbreviation disambiguation, short-form expansion, Kısaltma ve Akronim Çözümleme | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| Příbuzné | 4 | 4 |
| Shrnutí≠ | Abbreviation and acronym resolution is a natural-language-processing pipeline that maps each short form in a text to its full-length definition using contextual cues from the surrounding text. It is especially important in medical, legal, and technical documents, where the same acronym may carry entirely different meanings across domains. The field's foundational algorithm was published by Schwartz and Hearst (2003) for biomedical literature and has since been extended by neural and transformer-based approaches. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGateDatová sada ↗ |
|
|