Process / pipeline
Clinical Text Mining — Clinical NLP Information Extraction
Clinical text mining is a specialised branch of natural language processing that extracts structured clinical facts — diagnoses, symptoms, medications, treatments, and ICD codes — from unstructured healthcare documents such as discharge summaries, progress notes, and radiology reports. Grounded in biomedical NLP models like BioBERT (Lee et al., 2020) and the i2b2/UTHealth shared-task benchmarks (Stubbs & Uzuner, 2015), it converts free-text clinical narratives into machine-readable data suitable for clinical decision support and health analytics.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Lee, J., Yoon, W., Kim, S., Kim, D., Kim, S., So, C. H., & Kang, J. (2020). BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, 36(4), 1234–1240. DOI: 10.1093/bioinformatics/btz682 ↗
- Stubbs, A. & Uzuner, Ö. (2015). Annotating risk factors for heart disease in clinical narratives for the 2014 i2b2/UTHealth shared task. Journal of the American Medical Informatics Association, 22(e1), e30–e39. link ↗