classification of emergency department ct imaging reports using natural language processing and...

7
Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith, Hyeong-Ah Choi This project supported by the NIH National Center for Research Resources (UL1RR031988 and KL2RR031987)

Upload: jeffry-merritt

Post on 04-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Classification of Emergency Department CT Imaging Reports

using Natural Language Processing and Machine Learning

Efsun Sarioglu, Kabir Yadav, Meaghan Smith, Hyeong-Ah Choi

This project supported by the NIH National Center for Research Resources(UL1RR031988 and KL2RR031987)

Page 2: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Background, Objective & Methods

Use of electronic medical record data for clinical research and quality improvement requires free-text data interpretation for outcomes of interest. Natural language processing has shown

promise for this purpose

To demonstrate real-world performance of a hybrid NLP-machine learning system for automated classification of radiology reports

Page 3: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Approach Overview Multicenter review of consecutive CT

reports obtained for facial trauma using a trained reference standard Medical Language Extraction and Encoding (MedLEE) WEKA 3.7.5 Salford Systems CART 6.6

Page 4: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Results

Total reports: 3710

Positive cases: 460 (12.4%)

Manual coding had high reliability Kappa=0.97 [95% CI 0.94-0.99]

Page 5: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

CART Decision Trees (50:50)Raw Text (8-node) NLP (9-node)

Page 6: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Classification Performance

Raw Text

NLP

Precision

0.949 0.968

Recall 0.932 0.964

F-score 0.940 0.966

Unexpectedly high performance of machine learning without NLP

Comparable to inter-rater performance and prior studies of inter-physician agreement

Comparable to prior real-world and simulation studies

Page 7: Classification of Emergency Department CT Imaging Reports using Natural Language Processing and Machine Learning Efsun Sarioglu, Kabir Yadav, Meaghan Smith,

Concluding Remarks How’s it novel?

One of only a handful of real-world NLP studies using validated reference standard

Translating existing NLP and machine learning technologies to support CER

Next step: validation Test approach using other clinical cases Evaluate different features or

classification algorithms