Extracting alcohol and substance abuse status from clinical notes: The added value of nursing data

Extracting alcohol and substance abuse status from clinical notes: The added value of nursing data

מאת: Topaz M., Murga L., Bar-Bachar O., Cato K., Collins S.
פורסם ב: Studies in Health Technology and Informatics
תיאור: We applied an open source natural language processing (NLP) system “NimbleMiner” to identify clinical notes with mentions of alcohol and substance abuse. NimbleMiner allows users to rapidly discover clinical vocabularies (using word embedding model) and then implement machine learning for text classification. We used a large inpatient dataset with over 50,000 intensive care unit admissions (MIMIC II). Clinical notes included physician-written discharge summaries (n = 51,201) and nursing notes (n = 412,343). We first used physician-written discharge summaries to train the system's algorithm and then added nursing notes to the physician-written discharge summaries and evaluated algorithms prediction accuracy. Adding nursing notes to the physician-written discharge summaries resulted in almost two-fold vocabulary expansion. NimbleMiner slightly outperformed other state-of-the-art NLP systems (average F-score = .84), while requiring significantly less time for the algorithms development.: Our findings underline the importance of nursing data for the analysis of electronic patient records. © 2019 International Medical Informatics Association (IMIA) and IOS Press.
SDGs : SDG 03  |  יחידות:   | מועד: 2019 |  קישור