can natural language processing fulfill the promise of electronic medical records?

7
Accepted Manuscript Can Natural Language Processing Fulfill the Promise of Electronic Medical Records? Paul A. Heidenreich, MD, MS PII: S1071-9164(14)00188-2 DOI: 10.1016/j.cardfail.2014.04.020 Reference: YJCAF 3296 To appear in: Journal of Cardiac Failure Received Date: 28 April 2014 Accepted Date: 28 April 2014 Please cite this article as: Heidenreich PA, Can Natural Language Processing Fulfill the Promise of Electronic Medical Records?, Journal of Cardiac Failure (2014), doi: 10.1016/j.cardfail.2014.04.020. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Upload: paul-a

Post on 02-Jan-2017

212 views

Category:

Documents


0 download

TRANSCRIPT

Accepted Manuscript

Can Natural Language Processing Fulfill the Promise of Electronic Medical Records?

Paul A. Heidenreich, MD, MS

PII: S1071-9164(14)00188-2

DOI: 10.1016/j.cardfail.2014.04.020

Reference: YJCAF 3296

To appear in: Journal of Cardiac Failure

Received Date: 28 April 2014

Accepted Date: 28 April 2014

Please cite this article as: Heidenreich PA, Can Natural Language Processing Fulfill the Promise ofElectronic Medical Records?, Journal of Cardiac Failure (2014), doi: 10.1016/j.cardfail.2014.04.020.

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service toour customers we are providing this early version of the manuscript. The manuscript will undergocopyediting, typesetting, and review of the resulting proof before it is published in its final form. Pleasenote that during the production process errors may be discovered which could affect the content, and alllegal disclaimers that apply to the journal pertain.

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

1

Can Natural Language Processing Fulfill the Promise of Electronic Medical Records?

Paul A. Heidenreich MD, MS

Veterans Affairs Palo Alto Healthcare System, Palo Alto, CA, 94304

Paul Heidenreich MD, MS

VA Palo Alto HCS, 111C

3801 Miranda Avenue, Palo Alto, CA 94306

650-849-1205

Fax 650-852-3473

Email [email protected]

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

2

Electronic medical records (EMRs) are now common in US health care settings though their

implementation has not always been welcomed by providers. [1] The promised large

improvements in efficiency and quality of care have not occurred, and many have argued that

such benefits will only be realized when provider notes are structured using check boxes or

drop down menus. While such a method of documenting will allow easier analysis, the effort

needed to enter all relevant data in this way is substantial and unattractive to clinicians.

However, such electronic data are already needed to report quality metrics to payers and

accrediting organizations, and facilities have resorted to hiring abstracters to collect data from

chart review. The cost of data abstraction by hand is high and markedly limits the ability of

facilities to participate in quality improvement activities, registries, and research collaborations.

A solution to this growing need for electronic data that limits the burden on providers and

hospitals is the use of natural language processing (NLP).

The process of NLP starts with unstructured (raw) text and determines the presence and

meaning of certain words or word combinations. [2] Neighboring words are also examined to

determine if they negate the finding such as “no” in “no edema”, or falsely negate the finding

such as the “no” in “no change in edema”. The accuracy is determined using trained chart

reviewers and revisions are made until an acceptable sensitivity (known as recall in NLP

terminology) and specificity are reached. Given the potential for such iterations to lead to

“overfitting” the optimized NLP algorithm is then tested in an independent dataset.

Prior work has demonstrated that NLP can be used to extract heart failure related findings from

radiography reports with high accuracy. In one study ,[2] NLP was found to be significantly

more accurate than hospital staff coders using expert review as the gold standard.

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

3

Cardiomegaly was identified by NLP with a sensitivity of 96% and a specificity of 100%

compared to 91% (sensitivity) and 90% (specificity) for hospital staff coders. Pleural effusions

were also more reliably identified with NLP compared to hospital staff (sensitivity 98% vs 78%,

both with 100% specificity). The Veterans Affairs Health Care System has recently used NLP

to extract the left ventricular ejection fraction from echocardiography reports with a high degree

of accuracy. [3] The goal is to obtain LVEF values from all reports system wide, make these

available in fields within the medical record, and then use the data to monitor and improve

performance (e.g. identify potential candidates for beta-blockers or mineralocorticoid

antagonists).

The study by Steinhubl and colleagues in this issue of the Journal [4] shows the additional

ability of NLP to identify heart failure signs and symptoms, and demonstrates the potential for

early identification of patients with heart failure. The investigators used NLP to scan over

three years of provider’s notes for over 4000 incident heart failure cases to determine the time

from first mention of a sign or symptom to subsequent diagnosis of heart failure. By including

a control group without a diagnosis of heart failure they were able to determine the predictive

value of certain symptoms, signs and findings of heart failure. One provocative finding was

that paroxysmal nocturnal dyspnea was present in 41% of patients ultimately diagnosed with

heart failure and was noted almost two years before a heart failure diagnosis was made. One

can see many potential clinical uses for this technology in identifying potential patients with

heart failure. Simple lists of patients with possible heart failure can be provided to providers in

the patient centered medical home. More sophisticated decision support can prompt clinicians

to consider certain diagnoses, order tests (e.g. echocardiography), or initiate treatment.

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

4

For patients with established heart failure, NLP has the potential to allow us to complement the

current referral system of specialty care with one of population health where proactive

involvement in patient care can occur through assessment of existing data. NLP can occur in

real time providing a continuous oversight of a population to identify patients who may benefit

from additional treatment including life prolonging medications and devices.

Once candidate patients are identified, referrals to specialists can be suggested and/or

computerized decision support can be made available to the primary provider.

The potential of NLP to reduce the cost of research and registry participation is also significant.

One of the greatest barriers to participation in research and clinical registries is the personnel

needed to collect data. NLP can drop this cost dramatically making it feasible for all hospitals

to participate in data collection collaborations (e.g. registries). Identification of potential

research subjects will be much faster with NLP and the ability to easily extract baseline data

and outcomes will allow pragmatic trials for a large number of conditions.

Another area of potential benefit of NLP in heart failure is improved prediction of

decompensation. Identification of readmissions is of particular interest to US hospitals as they

face large financial penalties for high readmission rates. Unfortunately, available models for

predicting readmission using data available from existing electronic records perform poorly (c

statistics < 0.65). Studies suggest that additional data on social status can improve prediction

of admission [5] and NLP can be helpful in capturing this data that often appears only in text.

Unfortunately, there are still significant barriers to developing and using NLP. Large upfront

personnel costs are needed to produce NLP algorithms. Expert chart reviews are required to

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

5

create the gold standard against which NLP is tested and most facilities will not have the ability

to develop these. Not all symptoms, signs and diagnoses can be accurately captured from

existing notes and separate NLP algorithms will need to be developed and tested for each data

element. There is also a lack of standardization. It is usually difficult to adapt an NLP that

has been developed by an outside institution given the variety of software used and the

differences in electronic medical records. Finally, NLP is only as good as the provider

documentation on which it is based. Any omissions or inaccuracies in provider notes will be

incorporated into the NLP results.

In summary, the electronic medical record contains a wealth of medical information as

narrative text which is difficult to find and use for clinical care and research. Extracting this

information from the text document has traditionally required manual chart review which is time

consuming and expensive. The study by Steinhubl using NLP suggests how the promised

economic and health benefits of electronic medical records may yet be achieved.

Disclosures: None

References

1) Babbott S, Manwell LB, Brown R, Montague E, Williams E, Schwartz M, Hess E,Linzer

M. Electronic medical records and physician stress in primary care: results from the

MEMO Study. J Am Med Inform Assoc. 2014 Feb;21(e1):e100-6. doi: 10.1136/amiajnl-

2013-001875.

2) Friedlin J, McDonald CJ. A natural language processing system to extract and code

concepts relating to congestive heart failure from chest radiology reports. AMIA Annu

Symp Proc. 2006:269-73.

MANUSCRIP

T

ACCEPTED

ACCEPTED MANUSCRIPT JCF Editorial: Heidenreich

6

3) Garvin JH, DuVall SL, South BR, Bray BE, Bolton D, Heavirland J, Pickard S,

Heidenreich P, Shen S, Weir C, Samore M, Goldstein MK. Automated extraction of

ejection fraction for quality measurement using regular expressions in Unstructured

Information Management Architecture (UIMA) for heart failure. J Am Med Inform Assoc.

2012 Sep-Oct;19(5):859-66.

4) CURRENT STUDY: Steinhubl Prevalence of Heart Failure Signs and Symptoms in a

Large Primary Care Population Identified Through the Use of Text and Data Mining of

the Electronic Health Record

5) Calvillo-King L, Arnold D, Eubank KJ, Lo M, Yunyongying P, Stieglitz H, Halm EA.

Impact of social factors on risk of readmission or mortality in pneumonia and heart

failure: systematic review. J Gen Intern Med. 2013 Feb;28(2):269-82.