Porovnat metody
Prohlédněte si vybrané metody vedle sebe; řádky, které se liší, jsou zvýrazněny.
| Těžba klinických textů× | Dolování vědeckých textů× | |
|---|---|---|
| Obor | Dolování textu | Dolování textu |
| Rodina | Process / pipeline | Process / pipeline |
| Rok vzniku≠ | 2000s–2020s (established domain; BioBERT milestone 2020) | 2019–2020 (modern transformer era); roots in earlier computational linguistics |
| Tvůrce≠ | Community-driven; foundational work by i2b2/UTHealth NLP challenges and BioBERT (Lee et al., 2020) | Community-developed; SciBERT (Beltagy et al., 2019) and SPECTER (Cohan et al., 2020) are landmark models |
| Typ≠ | NLP information-extraction pipeline | NLP pipeline for scientific literature |
| Původní zdroj≠ | Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H., & Kang, J. (2020). BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, 36(4), 1234–1240. DOI ↗ | Beltagy, I., Lo, K., & Cohan, A. (2019). SciBERT: A Pretrained Language Model for Scientific Text. EMNLP 2019. link ↗ |
| Další názvy≠ | clinical NLP, clinical information extraction, Klinik Metin Madenciliği | Bilimsel Metin Madenciliği, scholarly NLP, academic text mining, scientific literature mining |
| Příbuzné≠ | 5 | 4 |
| Shrnutí≠ | Clinical text mining is a specialised branch of natural language processing that extracts structured clinical facts — diagnoses, symptoms, medications, treatments, and ICD codes — from unstructured healthcare documents such as discharge summaries, progress notes, and radiology reports. Grounded in biomedical NLP models like BioBERT (Lee et al., 2020) and the i2b2/UTHealth shared-task benchmarks (Stubbs & Uzuner, 2015), it converts free-text clinical narratives into machine-readable data suitable for clinical decision support and health analytics. | Scientific text mining is a natural-language-processing pipeline applied to academic literature. Grounded in domain-specific pretrained models such as SciBERT (Beltagy et al., 2019) and SPECTER (Cohan et al., 2020), it automatically extracts hypotheses, methodologies, findings, and scholarly contributions from full-text papers or abstracts, enabling systematic review automation, research-trend analysis, and science mapping at scale. |
| ScholarGateDatová sada ↗ |
|
|