scaling up image annotation for deep learning: … · scaling up image annotation for deep...

32
S CALING UP IMAGE ANNOTATION FOR DEEP LEARNING : S TANDARDS , LABELS FROM TEXT , AND LEVERAGING MULTI - INSTITUTIONAL DATA Daniel L. Rubin, MD, MS Professor of Biomedical Data Science, Radiology, Medicine (Biomedical Informatics), and Ophthalmology (by courtesy) Stanford University

Upload: others

Post on 28-May-2020

35 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

SCALING UP IMAGE ANNOTATION FORDEEP LEARNING: STANDARDS, LABELSFROM TEXT, AND LEVERAGING MULTI-

INSTITUTIONAL DATA

Daniel L. Rubin, MD, MS

Professor of Biomedical Data Science, Radiology, Medicine (Biomedical Informatics), and

Ophthalmology (by courtesy)Stanford University

Page 2: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

AcknowledgementsStudents, Post-docs, Residents, Staff, and Collaborators

– Bao Do

– Selen Bozkurt

– Assaf Hoogi

Funding Support– NCI QIN grants

U01CA142555,1U01CA190214, 1U01CA187947

– Stanford-AstraZeneca Collaboration Grant– NVIDIA Academic Hardware Grant Program– Stanford Philips and GE BlueSky

– Alfiia Galimzianova

– Imon Banerjee

– Christopher Re

– Sandy Napel

– Chris Beaulieu– Darvin Yi

– Xuerong Xiao

– Carson Lam

– Blaine Rister

– Hersh Sagreiya

– Emel Alkim

– Ann Leung

– Matthew Lungren

– Jared Dunnmon

– David Conn

– Mete Akdogan

– Niranjan Balachandar

– Curt Langlotz

– Ted Leng

– Joelle Hallak

– Luis de Sisternes

– Zaid Nabulsi

– Michael Gensheimer

Page 3: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Challenges to scaling up image annotation for deep learning Varying data/file formats for saving image

annotations Difficulty leveraging free text radiology

reports as a source for labels for images Hurdles to sharing data across institutions

to build more robust AI models

Page 4: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Challenges to scaling up image annotation for deep learning Varying data/file formats for saving image

annotations Difficulty leveraging free text radiology

reports as a source for labels for images Hurdles to sharing data across institutions

to build more robust AI models

Page 5: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Detection,Segmentation

Classification,Diagnosis

Image annotations are crucial for AI

ROI1

ROI2

Page 6: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Varying file formats for image annotations Regions of interest

(ROIs) and image labels◦ DICOM-PS◦ Burned-in image◦ Proprietary formats

Clinical labels (diagnoses, findings, patient outcomes◦ EMR◦ Spreadsheets◦ Delimited files◦ Proprietary formats

Page 7: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Vendor 4

Lack of image annotation standards thwarts interoperability

Vendor 1 Vendor 3

Vendor 2 3D Slicer

Copyright © Daniel Rubin 2015

Page 8: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Annotation and Image Markup (AIM) XML schema to make the information that

humans and machines see in images machine-accessible in standard format

Enables interoperability of this information across systems and computer applications

Developed by National Cancer Imaging Program at NCI

Harmonized/incorporated into DICOM-SRRubin DL, et. al: Medical Imaging on the Semantic Web: Annotation and Image Markup, AAAI 2008.https://wiki.nci.nih.gov/display/AIM/Annotation+and+Image+Markup+-+AIM

Copyright © Daniel Rubin 2018

Page 9: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

AIM captures annotations in XML

Copyright © Daniel Rubin 2017

QUALITATIVE

QUANTITATIVE

Page 10: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Anatomic Entity: Upper lobe of left lung (RID1327)Observation: Mass (RID3874)

Characteristic: Microlobulated margin (RID5712)Geometric Shape: Polyline

2D coordinates: {(x,y), (x,y)….}Calculation: Largest diameter result: 2.8 cmDiagnosis: Lung cancer

DICOM SR (TID 1500)

XML

HL7 CDA/FHIR

AIM annotations interoperate with other standards

Copyright © Daniel Rubin 2017

https://github.com/NCIP/annotation-and-image-markup/tree/master/AIMToolkit_v3.0.2_rv11/examples/ANIVATR

Page 11: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

eLectronic Physician Annotation Device ePAD: free, open source Web-based image viewer and annotator AIM-compliant annotation; supports AIM templates Plugins for quantifying lesion features

Template

ROI

Values

Rubin, Willrett, O'Connor, Hage, Kurtz, Moreira, Translational Oncology 7(1):23-35, 2014http://epad.stanford.edu

Quantitative image features

Annotations linked to images

Qualitative image features

Page 12: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

AIM being used for public sharing of image annotations The Cancer Genome Atlas (TCGA) imaging

projects◦ Brain cancer◦ Breast cancer◦ Bladder Cancer

The Cancer Imaging Archive (TCIA) Quantitative Imaging Network (QIN) of NCI

Page 13: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Challenges to scaling up image annotation for deep learning Varying data/file formats for saving image

annotations Difficulty leveraging free text radiology

reports as a source for labels for images Hurdles to sharing data across institutions

to build more robust AI models

Page 14: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Copyright © Stanford University 2018

Motivating challenges for needing to use free text reports• Scarcity of annotated images -

need millions of images to train a complex neural network

• Annotation is a laborious, time consuming and expensive

• Radiology reports are associated with routine clinical images that could be leveraged

Page 15: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Radiological image annotation: leveraging clinical notes• PACS contains millions of images “labeled” in the form of

unstructured notes.• Why not to use the notes for annotating the images?

• Unstructured free text cannot be directly interpreted by a machine due to the ambiguity and subtlety of natural language.

• How to extract the semantic information from the clinical notes?

Radiologist’s noteCT image

Copyright © Stanford University 2018

Page 16: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Word embeddings to identify annotation labels from narrative text

Unsupervised deep learning algorithms (e.g., word2vec) can learn a feature representation from texts without the need of supplying specific domain knowledge

Word embedding using deep learning (4,442 words) projected in two dimensions

Imon Banerjee, JDI 30:506-518, 2017

Page 17: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Ontocrawler: Generating domain dictionaries for annotation tasks Created an ontology crawler using SPARQL that

grabs the sub-classes and synonyms of the domain-specific terms from NCBO bio-portal.

Generate a focused dictionary for each domain of radiology.

• {‘apoplexy’, ‘contusion’, ‘hematoma’, ...} ‘hemorrhage’

Copyright © Stanford University 2018

Page 18: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Intelligent word embedding pipeline

Copyright © Stanford University 2018

Page 19: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Word embedding + classification model Stores each word in as a point in vector space Unsupervised, built just by reading huge corpus Can be used as features to train a supervised model with a

small subset of annotations Reusable/extensible to many text extraction use cases

Word embedding

CorpusDocument embedding Classifier

Positive

Negative

Document classificationMikolov, Distributed representations of words and phrases and their compositionality

Copyright © Stanford University 2018 Imon Banerjee, In preparation

Page 20: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Example 1: Head CT Task: Label intracranial hemorrhage based on radiology

report Dataset: ◦ 10,000 CT reports from Stanford◦ ~900 CT reports from UPMC

Gold-standard annotation:◦ Subset of 1,188 of reports labeled independently by two

radiologists (agreement ~0.98 kappa score) Classification labels:◦ No intracranial hemorrhage◦ Diagnosis of intracranial hemorrhage unlikely, though cannot be

completely excluded◦ Diagnosis of intracranial hemorrhage possible◦ Diagnosis of intracranial hemorrhage probable, but not definitive◦ Definite intracranial hemorrhage

Copyright © Stanford University 2018Banerjee, Imon, Sriraman Madhavan, Roger Eric Goldman, and Daniel L. Rubin, AMIA Annual Symposium Proceedings, vol. 2017, p. 411. American Medical Informatics Association, 2017.

Page 21: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Comparative performance1. Out-of-box word2vec – without semantic

mapping2. Proposed model - with semantic mapping

21

Out-of-box word2vec Proposed model

Classifier Precision Recall F1-score Precision Recall F1-score

Random Forest 87.59% 89.17% 87.78% 88.64% 90.42% 89.08%

KNN (n = 10) 86.73% 88.90% 87.47% 88.60% 89.91% 88.88%

KNN (n = 5) 87.52% 88.65% 87.74% 88.54% 89.62% 88.76%

SVM (Radial kernel) 63.98% 79.96% 71.07% 64.19% 80.09% 71.25%

SVM (Polynomial kernel) 62.40% 78.97% 69.70% 63.25% 79.49% 70.43%

Copyright © Stanford University 2018

Page 22: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Example 2: Chest CT Task: Label pulmonary embolism based on

radiology report Dataset: ◦ 100k+ de-identified chest CT reports (Stanford and

UPMC) Baseline comparison:◦ Compare to published state-of-the-art rule-based

method for PE extraction (PeFinder) Classification labels:◦ PE acute (positive)◦ PE present (positive)◦ PE subsegmental only (negative)

Copyright © Stanford University 2018

Banerjee, Imon, Matthew C. Chen, Matthew P. Lungren, and Daniel L. Rubin. "Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort." Journal of biomedical informatics 77 (2018): 11-20.

Page 23: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

ROC curve measures

Stanford dataset UPMC dataset

Copyright © Stanford University 2018

Page 24: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Example 3: Mammography Task: Label BI-RADS final assessment

category based on findings of radiology report

Dataset: ◦ 300K mammography reports

Baseline comparison:◦ Published rule-based information extraction

method (J Biomed Inform 62:224-31, 2016) Classification labels:◦ BI-RADS Class 0 - 6

Copyright © Stanford University 2018

Page 25: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Results: Comparison with a Rule-based method

*Rule-based system: J Biomed Inform. 62:224-31, 2016

Copyright © Stanford University 2018

Page 26: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Challenges to scaling up image annotation for deep learning Varying data/file formats for saving image

annotations Difficulty leveraging free text radiology

reports as a source for labels for images Hurdles to sharing data across institutions

to build more robust AI models

Page 27: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Centralized approach to AI model development

AI Model

Legal issuesIntellectual Property

Copyright © Stanford University 2018

Page 28: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

P(Data|coefficients);Update parameters

P(Data|coefficients);Update parameters

P(Data|coefficients);Update parameters

Big Data aggregation without data sharing

Initiating site

Site 1

No data sharing required

Site 2

Site 3

Fit model with input parameters; return coefficientsIterate…

Courtesy Phil LavoriCopyright © Stanford University 2018

Page 29: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

A B

C D

Centrally hosted

J Am Med Inform Assoc 25(8):945-954, 2018

Ensemble single institution

Alternative models for training distributed deep learning models

Single weight transfer Cyclical weight transfer

Page 30: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Centrally hosted dataN = 6000 patients

A B

Cyclical weight transfer has similar performance to centrally-hosted training

Random classification

Accuracy increases with number of collaborating institutions

Results based on having 4 institutions

J Am Med Inform Assoc 25(8):945-954, 2018

Page 31: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

SummaryThree challenges to scaling up image annotation for deep learning◦ Varying data/file formats for saving image

annotations Image annotation standards (AIM) and tools (ePAD)

◦ Difficulty leveraging free text radiology reports as a source for labels for images Word embeddings and classification models for

information extraction◦ Hurdles to sharing data across institutions to

build more robust AI models Distributed computation of deep learning models

Page 32: SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: … · SCALING UP IMAGE ANNOTATION FOR DEEP LEARNING: STANDARDS, LABELS FROM TEXT, AND LEVERAGING MULTI- INSTITUTIONAL DATA Daniel L

Thank you.

Contact info:[email protected]