방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 교차 문서 개체 추적× | 정보 추출× | |
|---|---|---|
| 분야 | 텍스트 마이닝 | 텍스트 마이닝 |
| 계열 | Process / pipeline | Process / pipeline |
| 기원 연도≠ | 1998 (scoring foundations); 2019 (neural joint model) | — |
| 창시자 | — | — |
| 유형≠ | NLP pipeline — cross-document coreference resolution | NLP structured-information task |
| 원전≠ | Bagga, A. & Baldwin, B. (1998). Algorithms for Scoring Coreference Chains. In Proceedings of the LREC 1998 Linguistic Coreference Workshop, pp. 563–566. link ↗ | Cowie, J. & Lehnert, W. (1996). Information Extraction. Communications of the ACM. DOI ↗ |
| 별칭 | cross-document coreference resolution, cross-doc entity linking, Belge Ötesi Varlık Takibi | IE, structured information extraction, Bilgi Çıkarma (Information Extraction) |
| 관련 | 4 | 4 |
| 요약≠ | Cross-document entity tracking, formally known as cross-document coreference resolution, identifies and merges all references to the same real-world entity scattered across a collection of documents. Rooted in the B3 evaluation framework introduced by Bagga and Baldwin (1998) and substantially advanced by the neural joint model of Barhom et al. (2019), the method builds entity clusters that span document boundaries — enabling multi-document understanding, knowledge-base population, and corpus-wide entity analysis. | Information extraction (IE) is a natural-language-processing task that converts unstructured text into structured information — such as events, relations, and attributes — so that facts buried in free-form documents become machine-readable records. The task was consolidated in early surveys by Cowie and Lehnert (1996) and later by Grishman (2012). |
| ScholarGate데이터셋 ↗ |
|
|