visual exploration of temporal data in electronic medical ... · visual exploration of temporal...

1
Visual Exploration of Temporal Data in Electronic Medical Records Josua Krause, Narges Razavian, PhD, Enrico Bertini, PhD, David Sontag, PhD New York University, New York, NY Electronic medical records (EMRs) and administrative data contain a large amount of distinct events, like diagnoses, laboratory tests, etc., making it difficult to “tell” the story of a patient. We propose patient-viz to address this issue by using a visual representation of the data. This allows us to provide the large number of distinct event types and additional information like costs and hospital stays in a manageable form. Using both an anonymized public and an unaltered private dataset we explore the usefulness of our tool. Abstract Longitudinal studies and insurance claims data generate a large amount of electronic medical records (EMRs) and administrative data. This data contains a large number of distinct events throughout the observed time window of the life of patients. Understanding, interpreting, and finding relations in those records is a challenging task that is hard to achieve using a tabular or similar representation. There- fore, visualization is needed to e.g. better understand predictive models built on top of the data, impact of comorbidities, progression of chronic diseases, or contributors of health care costs. Our proposed tool patient-viz has a visually rich design aimed to make the huge amount of administrative data manageable for data scientists and medical doctors. The goal of the tool is to provide a quick overview of one patient which can be further explored to inspect detailed information. The input can be any temporal event data with a large number of differently typed events. We test our tool with online accessible semi-synthetic data provided by CMS [1] and privately collected data from a major US insurance company. Introduction Josua Krause ( [email protected] ) Narges Razavian ( [email protected] ) Enrico Bertini ( [email protected] ) David Sontag ( [email protected] ) Website: https://github.com/nyuvis/patient-viz Contact Information 1. CMS medicare claims synthetic public use files. www.cms.gov/Research-Statistics-Data-and- Systems/Downloadable-Public-Use-Files/SynPUFs/, 2008–2010. [Online; accessed 16-July-2015]. 2. HCUP. Clinical Classifications Software (CCS) for ICD-9-CM. www.hcup-us.ahrq.gov/toolssoftware/ ccs/ccsfactsheet.jsp, 2012. [Online; accessed 16-July-2015]. 3. C. Plaisant, B. Milash, A. Rose, S. Widoff, and B. Shneiderman. Lifelines: Visualizing personal histories. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI ’96, pages 221–227, New York, NY, USA, 1996. ACM. 4. D. Klimov, Y. Shahar, and M. Taieb-Maimon. Intelligent visualization and exploration of time- oriented data of multiple patients. Artificial Intelligence in Medicine, 49(1):11 – 31, 2010. References Vertical selection can be used to see all events of a particular day. Here a set of lab-tests was performed. Out-of-range values are depicted in red. Steep ascends in the timeline caused by many new event types are an indicator for an incident happening in the live of a patient (left). The CMS provided data (right) merges three patients for anonymization purposes which mitigates the impact of incidents letting steep ascends mostly disappear. A user can use a lens to get information about interesting events. The horizontal granularity of single events is by day and the vertical position is determined by the first occurrence of the particular event in the history of the patient. The tool proved to be helpful for analyzing the output of machine learning algorithms in predictive healthcare analysis. It provides a quick way to identify the problems of an automatically generated model by looking at patients that were classified wrong or turn out to be outliers in the cohort. Additionally, the tool enables clinicians to quickly “tell the story” of the patient using just the administrative data often found in insurance claims. As future work we plan to work more closely with physicians and investigate how we can improve our tool to integrate more with their tasks. Conclusions CCS [2] hierarchy of event types General patient information Selected events Costs per day Time First occurrence One event (eg. Diagnosis 593 on June 5th) Events of same type (eg. Diagnosis 250) Hospital stays Our design is similar to LifeLines [3] with additional information shown in overlays and using a smaller vertical footprint which allows for a larger number of distinct types to be shown at the same time. Another notable similar work is the tool VISITORS [4] which requires even more vertical real estate per type. Every event is represented as rectangle whose color indicates its type, ie. a diagnosis (green), a performed procedure (orange), a laboratory test result (blue), a prescribed medication (purple), a physician (pink), or a hospital (brown). Visual Design

Upload: others

Post on 23-Mar-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Visual Exploration of Temporal Data in Electronic Medical ... · Visual Exploration of Temporal Data in Electronic Medical Records Josua Krause, Narges Razavian, PhD, Enrico Bertini,

VisualExplorationofTemporalDatainElectronicMedicalRecords

JosuaKrause,NargesRazavian,PhD,EnricoBertini,PhD,DavidSontag,PhD

NewYorkUniversity,NewYork,NY

Electronic medical records (EMRs) andadministrativedatacontaina largeamountofdistinct events, like diagnoses, laboratorytests,etc.,makingitdifficultto“tell”thestoryofapatient.Weproposepatient-viztoaddressthis issue by using a visual representation ofthe data. This allows us to provide the largenumberofdistincteventtypesandadditionalinformation like costs and hospital stays in amanageable form.Usingbothananonymizedpublic and an unaltered private dataset weexploretheusefulnessofourtool.

Abstract

Longitudinalstudiesandinsuranceclaimsdatageneratealargeamountofelectronicmedicalrecords (EMRs) and administrative data. Thisdatacontainsalargenumberofdistincteventsthroughout theobservedtimewindowof thelife of patients. Understanding, interpreting,and finding relations in those records is achallengingtaskthatishardtoachieveusingatabularorsimilarrepresentation.There-fore, visualization is needed to e.g. betterunderstand predictivemodels built on top ofthedata,impactofcomorbidities,progressionof chronic diseases, or contributors of healthcarecosts.Our proposed tool patient-viz has a visuallyrichdesignaimedtomakethehugeamountofadministrative data manageable for datascientistsandmedicaldoctors.Thegoalofthetool is to provide a quick overview of onepatient which can be further explored toinspectdetailedinformation.Theinputcanbeanytemporaleventdatawitha largenumberof differently typed events. We test our toolwith online accessible semi-synthetic dataprovided by CMS [1] and privately collecteddatafromamajorUSinsurancecompany.

Introduction

JosuaKrause([email protected]) NargesRazavian([email protected]) EnricoBertini([email protected])DavidSontag([email protected]) Website:https://github.com/nyuvis/patient-viz

ContactInformation 1. CMSmedicareclaimssyntheticpublicusefiles.www.cms.gov/Research-Statistics-Data-and-Systems/Downloadable-Public-Use-Files/SynPUFs/,2008–2010.[Online;accessed16-July-2015].

2. HCUP.ClinicalClassificationsSoftware(CCS)forICD-9-CM.www.hcup-us.ahrq.gov/toolssoftware/ccs/ccsfactsheet.jsp,2012.[Online;accessed16-July-2015].

3. C.Plaisant,B.Milash,A.Rose,S.Widoff,andB.Shneiderman.Lifelines:Visualizingpersonalhistories.InProceedingsoftheSIGCHIConferenceonHumanFactorsinComputingSystems,CHI’96,pages221–227,NewYork,NY,USA,1996.ACM.

4. D.Klimov,Y.Shahar,andM.Taieb-Maimon.Intelligentvisualizationandexplorationoftime-orienteddataofmultiplepatients.ArtificialIntelligenceinMedicine,49(1):11–31,2010.

References

Verticalselectioncanbeusedtoseealleventsofaparticular day. Here a set of lab-tests wasperformed.Out-of-rangevaluesaredepictedinred.

Steepascendsinthetimelinecausedbymanyneweventtypesareanindicatorforanincidenthappeningintheliveofapatient(left).TheCMSprovideddata(right)mergesthreepatientsforanonymizationpurposeswhichmitigatestheimpactofincidentslettingsteepascendsmostlydisappear.

A user can use a lens to get information aboutinteresting events. The horizontal granularity ofsingle events is by day and the vertical position isdeterminedbythefirstoccurrenceoftheparticulareventinthehistoryofthepatient.

Thetoolprovedtobehelpfulforanalyzingtheoutputofmachine learning algorithms in predictive healthcareanalysis.Itprovidesaquickwaytoidentifytheproblemsof an automatically generated model by looking atpatients that were classified wrong or turn out to beoutliers in the cohort. Additionally, the tool enablesclinicians to quickly “tell the story” of the patient usingjust the administrative data often found in insuranceclaims.Asfutureworkweplantoworkmorecloselywithphysiciansand investigatehowwecan improveour tooltointegratemorewiththeirtasks.

Conclusions

CCS[2]hierarchyofeventtypes Generalpatientinformation Selectedevents

Costsperday

Time

Firstoccurrence

Oneevent(eg.Diagnosis593onJune5th)

Eventsofsametype(eg.Diagnosis250)

Hospitalstays

Our design is similar to LifeLines [3] withadditional information shown in overlays andusingasmallerverticalfootprintwhichallowsforalargernumberofdistincttypestobeshownatthe same time. Another notable similar work isthe tool VISITORS [4]which requires evenmoreverticalrealestatepertype.Every event is represented as rectangle whosecolor indicates its type, ie.adiagnosis (green),aperformedprocedure (orange), a laboratory testresult (blue),aprescribedmedication(purple),aphysician(pink),orahospital(brown).

VisualDesign