Porovnat metody
Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.
| Sledování entit napříč dokumenty× | Extrakce informací× | |
|---|---|---|
| Obor | Dolování textu | Dolování textu |
| Rodina | Process / pipeline | Process / pipeline |
| Rok vzniku≠ | 1998 (scoring foundations); 2019 (neural joint model) | — |
| Tvůrce | — | — |
| Typ≠ | NLP pipeline — cross-document coreference resolution | NLP structured-information task |
| Původní zdroj≠ | Bagga, A. & Baldwin, B. (1998). Algorithms for Scoring Coreference Chains. In Proceedings of the LREC 1998 Linguistic Coreference Workshop, pp. 563–566. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| Další názvy | cross-document coreference resolution, cross-doc entity linking, Belge Ötesi Varlık Takibi | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| Příbuzné | 4 | 4 |
| Shrnutí≠ | Cross-document entity tracking, formally known as cross-document coreference resolution, identifies and merges all references to the same real-world entity scattered across a collection of documents. Rooted in the B3 evaluation framework introduced by Bagga and Baldwin (1998) and substantially advanced by the neural joint model of Barhom et al. (2019), the method builds entity clusters that span document boundaries — enabling multi-document understanding, knowledge-base population, and corpus-wide entity analysis. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGateDatová sada ↗ |
|
|