Uporedite metode
Pregledajte izabrane metode jednu pored druge; redovi koji se razlikuju su istaknuti.
| Praćenje entiteta kroz dokumenta× | Ekstrakcija informacija× | |
|---|---|---|
| Oblast | Rudarenje teksta | Rudarenje teksta |
| Porodica | Process / pipeline | Process / pipeline |
| Godina nastanka≠ | 1998 (scoring foundations); 2019 (neural joint model) | — |
| Tvorac | — | — |
| Tip≠ | NLP pipeline — cross-document coreference resolution | NLP structured-information task |
| Temeljni izvor≠ | Bagga, A. & Baldwin, B. (1998). Algorithms for Scoring Coreference Chains. In Proceedings of the LREC 1998 Linguistic Coreference Workshop, pp. 563–566. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| Drugi nazivi | cross-document coreference resolution, cross-doc entity linking, Belge Ötesi Varlık Takibi | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| Srodne | 4 | 4 |
| Sažetak≠ | Cross-document entity tracking, formally known as cross-document coreference resolution, identifies and merges all references to the same real-world entity scattered across a collection of documents. Rooted in the B3 evaluation framework introduced by Bagga and Baldwin (1998) and substantially advanced by the neural joint model of Barhom et al. (2019), the method builds entity clusters that span document boundaries — enabling multi-document understanding, knowledge-base population, and corpus-wide entity analysis. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGateSkup podataka ↗ |
|
|