arturo gonzalez izquierdo - university college londondr arturo gonzález-izquierdo ucl institute of...

12
UCL Institute of Health Informatics UCL Cancer Domain - April 2018 UCL Cancer Domain AI Resources and Methodologies Use of Large Scale Electronic Health Records in Cancer Research Dr Arturo González-Izquierdo UCL Institute of Health Informatics

Upload: others

Post on 02-Aug-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

UCL Cancer Domain AI Resources and Methodologies

Use of Large Scale Electronic Health Records in Cancer Research

Dr Arturo González-Izquierdo

UCL Institute of Health Informatics

Page 2: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

The CALIBER research platform as a resource for cancer research with potential for machine

learning applications

Page 3: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

UCL Institute of Health Informatics

“We aim to conduct high quality research that leverages large scale databases and health informatics approaches to improve health at local, national and international levels”

Page 4: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Scotland 5 University of Edinburgh, University of Aberdeen, University of Dundee, University of Glasgow, University of St Andrews, University of Strathclyde 6

Midlands 4 University of Birmingham, University of Leicester, University of Nottingham, University of Warwick

Cambridge 3 Wellcome Sanger Institute, European Bioinformatics Institute, University of Cambridge

London 5 UCL, Imperial, King’s, LSHTM, QMUL

Oxford 1 University of Oxford

Wales/Northern Ireland 2 Swansea University, Queen’s University Belfast

1 mission 1 director 4 research priorities 6 sites 20 research organisations

Page 5: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Electronic Health Records

Clinical Practice Research Datalink

National Cancer Registration and Analysis Service

Hospital Episode Statistics

Office for National Statistics

Page 6: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Chronic cough Weight loss

Chronic obstructive airway disease Recurrent mild chest infection

Pneumonia

Lung Cancer Diagnosis

Linked Electronic Health Records

PRIMARY CARE

CANCER REGISTRATIONS

SECONDARY CARE

DEPRIVATION AND MORTALITY

Page 7: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Linked Electronic Health Records

Chronic cough Weight loss

Chronic obstructive airway disease Recurrent mild chest infection

Pneumonia hospitalisation

Lung Cancer Diagnosis

Registration date Consultation date Consultation date Admission date Admission date Date of death

Date of birth Blood tests (routine) Sputum tests Diagnosis Diagnosis (cancer type, stage, metastases)

Underlying cause

Blood pressure Spirometry Diagnosis Additional diagnosis (deep vein thrombosis Pneumonia) Procedures (surgery)

Subsidiary causes

Weight Chest x-rays Procedures

Procedure for biopsy of lesion (bronchoscopy, Chest drain)

Treatment (chemotherapy, radiotherapy)

Height CT chest

Chest imaging (x-ray, CT, PET CT, chest drain insertion)

Physical activity Treatment (antibiotic, Inhalers)

Discharge date

Health history

(heart, diabetes,

stroke)

Smoking

Alcohol

Contraception

Immunisations

Discharge date

Page 8: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

The CALIBER Research Platform

Patient Population

Cohort identification methods

Deep phenotyping algorithms

Longitudinal clinical trajectories

Precise temporal allocation of Exposures and outcomes

Page 9: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

New cancer related research in CALIBER

• The CALIBER Research Platform – 5 Newly approved (or awaiting approval) cancer

specific projects • Disease epidemiology

– Characterisation and causal inference

– Incidence and progression of disease

– Diagnosis and risk of cancer-specific outcomes

• Healthcare utilisation

• Drug repurposing

Page 10: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Challenges EHR’s observation window

Start End

Relevant clinical event

Exposure to factor of interest

Outcome of interest

Death

Page 11: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

Opportunities

• Recent willingness by data custodians to research health data using machine learning based methodologies

• Wide range of exploratory or hypothesis generation/test studies – Patient classification (Machine Learning sub-phenotyping) – Detailed healthcare utilisation patterns (multi-state trajectory

flows) – Sophisticated epidemiological/statistical methods

computationally feasible for causal inference – EHR based decision/early detection tools (automation)

Page 12: Arturo Gonzalez Izquierdo - University College LondonDr Arturo González-Izquierdo UCL Institute of Health Informatics UCL Institute of Health Informatics UCL Cancer Domain - April

UCL Institute of Health Informatics UCL Cancer Domain - April 2018

The Data Lab

Natalie Fitzpatrick Data Science Facilitator [email protected] CALIBER portal https://www.caliberresearch.org/portal Denaxas Lab http://denaxaslab.org/