current evidence and emerging methods regarding validation of the ahrq qis patrick s. romano, md mph...

48
Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research AHRQ 2012 Annual Conference

Upload: cuthbert-fisher

Post on 17-Dec-2015

220 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Current evidence and emerging methods regarding validation of the AHRQ QIs

Patrick S. Romano, MD MPHUC Davis Center for Healthcare Policy and Research

AHRQ 2012 Annual Conference

Page 2: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Outline and learning objectives

No disclosures (except support from AHRQ) To describe key domains of validity for health

care quality measures To illustrate these key domains of validity with

examples based on the AHRQ Quality Indicators

To summarize methods and opportunities for AHRQ QI users who may wish to be involved in validation activities

Page 3: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Inpatient QIsInpatient QIs

MortalityMortalityUtilizationUtilization

VolumeVolume

AHRQ Quality Indicators

Prevention QIsPrevention QIs

(Area Level)(Area Level)Avoidable HospitalizationsAvoidable HospitalizationsOther Avoidable ConditionsOther Avoidable Conditions

Patient Safety IndicatorsPatient Safety Indicators

ComplicationsComplicationsFailure-to-rescueFailure-to-rescueUnexpected deathUnexpected death

Pediatric / Pediatric / Neonatal Neonatal

QIsQIs

Page 4: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

ValidityValidity

The degree to which a measure The degree to which a measure accurately represents the true state of accurately represents the true state of the phenomenon of interest (i.e., “free of the phenomenon of interest (i.e., “free of systematic error”)systematic error”)

Does this measure what it purports to Does this measure what it purports to measure (quality)?measure (quality)?

Page 5: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Validation – A conceptual frameworkValidation – A conceptual framework Face validity is the degree to which a measure Face validity is the degree to which a measure

“appears” to measure the phenomenon of interest“appears” to measure the phenomenon of interest– Content validity is a related concept, focused on whether the Content validity is a related concept, focused on whether the

content of a measure adequately samples all relevant domains content of a measure adequately samples all relevant domains of the concept of interest (coverage)of the concept of interest (coverage)

Criterion (concurrent) validity is the degree to which Criterion (concurrent) validity is the degree to which a measure generates data that agree with data from a measure generates data that agree with data from a better (“gold standard”) approach.a better (“gold standard”) approach.

Predictive validity is the degree to which a measure Predictive validity is the degree to which a measure successfully predicts an outcome of interest. successfully predicts an outcome of interest.

Construct (convergent) validity is the degree to which Construct (convergent) validity is the degree to which a measure correlates with other measures, based on a measure correlates with other measures, based on a construct that is grounded in prior literature or a a construct that is grounded in prior literature or a sound conceptual frameworksound conceptual framework

Page 6: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Face validity: perspectivesFace validity: perspectives

DevelopersDevelopers Expert panelsExpert panels Users and stakeholdersUsers and stakeholders

Page 7: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Face validity: NQFFace validity: NQF Health outcomes: “A rationale supports the relationship of Health outcomes: “A rationale supports the relationship of

the health outcome to at least one healthcare structure, the health outcome to at least one healthcare structure, process, intervention, or service.”process, intervention, or service.”

Intermediate outcomes: “Quantity, quality, and consistency Intermediate outcomes: “Quantity, quality, and consistency of… evidence that the measured intermediate clinical of… evidence that the measured intermediate clinical outcome leads to a desired health outcome.”outcome leads to a desired health outcome.”

Processes or structures: “Quantity, quality, and consistency Processes or structures: “Quantity, quality, and consistency of… evidence that the measured healthcare process leads of… evidence that the measured healthcare process leads to desired health outcomes in the target population with to desired health outcomes in the target population with benefits that outweigh harms to patients.”benefits that outweigh harms to patients.”

Patient experience: “Evidence that the measured aspects Patient experience: “Evidence that the measured aspects of care are those valued by patients and for which the of care are those valued by patients and for which the patient is the best and/or only source (OR evidence that patient is the best and/or only source (OR evidence that patient experience… is correlated with desired outcomes”patient experience… is correlated with desired outcomes”

Page 8: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Face validity: NQFFace validity: NQF ““The The measure specifications are consistent with the are consistent with the

evidence presented to support the focus of evidence presented to support the focus of measurement… The measure is specified to capture measurement… The measure is specified to capture the most inclusive target population indicated by the the most inclusive target population indicated by the evidence, and exclusions are supported by the evidence, and exclusions are supported by the evidence.”evidence.”

““Measure specifications include the target population include the target population (denominator) to whom the measure applies, (denominator) to whom the measure applies, identification of those from the target population who identification of those from the target population who achieved the specific measure focus…, measurement achieved the specific measure focus…, measurement time window, exclusions, risk adjustment/stratification, time window, exclusions, risk adjustment/stratification, definitions, data source, code lists with descriptors, definitions, data source, code lists with descriptors, sampling, scoring/computation.”sampling, scoring/computation.”

Page 9: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

AHRQ expert panel process

Intended to establish consensual (face) validity Modified RAND/UCLA Appropriateness Method Physicians of various specialties/subspecialties, nurses,

other professionals (e.g., midwife, pharmacist) Potential PSIs were rated by 8 multispecialty panels;

surgical PSIs were also rated by 3 surgical panels All panelists rated all assigned indicators (1-9) on:

– Overall usefulness– Likelihood of being preventable – Likelihood of being due to medical error– Likelihood of being clearly charted in the medical record– Susceptibility to bias due to case mix

Page 10: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

AHRQ expert panel process

Pre-conference ratings and comments Individual ratings returned to panelists with

distribution of ratings and other panelists’ comments Telephone conference call(s) focusing on high-

variability items and panelists’ suggestions Suggestions adopted only by consensus Post-conference ratings and comments Exclude indicators rated “Unclear,” “Unclear-,” or

“Unacceptable”: – Median score <7, OR– At least 2 panelists rated the indicator in each of the extreme 3-

point ranges

Page 11: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Potential PSIs not adoptedOnly 18 “accepted” from original list of 48

“Experimental” PSIs “Rejected” PSIsAspiration pneumoniaCABG after PTCADecubitus ulcer in high risk patientsIn-hospital fractures possibly related to fallsIntraoperative nerve compression injuriesMalignant hyperthermiaPostoperative acute myocardial infarctionPostoperative iatrogenic complications – cardiac system

Postoperative iatrogenic complications – nervous system

Reopening of surgical siteSuture of lacerationObstetric wound complications- cesareanObstetric wound complications- vaginalOther obstetric complicationsPostpartum urinary tract infectionUterine rupture

Dosage complications Iatrogenic hypotension

Intestinal infection due to C. difficile

Postop iatrogenic complications – digestive complications

Postop iatrogenic complications – respiratory complications

Postop iatrogenic complications – urinary complications

Postop iatrogenic complications – vascular complications

Postoperative pneumonia Unexpected LOS/Conditional LOS

Obstetric thrombosis or embolism Puerperal infection

Page 12: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Face validity varies by proposed purposeFace validity varies by proposed purpose

Page 13: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Face validity varies by methodFace validity varies by method

Page 14: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Criterion validity: perspectivesCriterion validity: perspectives

Is the purported “gold standard” really a Is the purported “gold standard” really a gold standard? How do we know?gold standard? How do we know?

Administrative data perspective versus Administrative data perspective versus registry data perspectiveregistry data perspective

Coding perspective versus clinical Coding perspective versus clinical perspective (whose truth)perspective (whose truth)

Validity can change… dramaticallyValidity can change… dramatically Finding false positives is easy, but what Finding false positives is easy, but what

about false negatives?about false negatives?

Page 15: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Criterion validity: NQFCriterion validity: NQF

““Empirical evidence of validity of Empirical evidence of validity of BOTH BOTH data elements AND measure score data elements AND measure score within acceptable norms; ANDwithin acceptable norms; AND

Identified threats to validity (lack of risk Identified threats to validity (lack of risk adjustment/stratification, multiple data adjustment/stratification, multiple data types/methods, systematic missing or types/methods, systematic missing or “incorrect” data) are empirically assessed “incorrect” data) are empirically assessed and adequately addressed so that results and adequately addressed so that results are not biased.”are not biased.”

Could be satisifed by either criterion or Could be satisifed by either criterion or construct testingconstruct testing

Page 16: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Criterion validity:Present on admission coding vs. chart review

PDI

Percentage not POA (%)

NACHRI Mich CA NY Mayo

PSI 1: Complications of Anesthesia 100 100 100 94

PSI 3: Decubitus Ulcer 60 42 11 14 18

PSI 5: Foreign Body Left During Proc 80 80 64 76 54

PSI 6: Iatrogenic Pneumothorax 89 100 73 65 78

PSI 7: Infection Due To Medical Care 57 36 65 65 60

PSI 8: Postop Hip Fracture 0 21 26 22

PSI 9: Postop Hemorrhage or Hematoma 97 100 79 71 87

PSI 10: Postop Physiologic or Metabolic 91 77 64 46

PSI 11: Postop Respiratory Failure 83 100 94 93 74

PSI 12: Postop DVT or PE 67 46 43 40

PSI 13: Postoperative Sepsis 60 60 73 70 76

PSI 14: Postop Wound Dehiscence 90 100

PSI 15: Accidental Puncture/Laceration 93 84 87 87 85

PSI 16: Transfusion Reaction 71 N/A 58 78 50

Page 17: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Criterion validity of PSIs linked to NSQIPRomano PS, et al. HSR 2009; 44(1):182

Cima RR, et al. Surg 2011; 150:943Koch CG, et al. J Am Coll Surg 2012

IndicatorSensitivity PPV LR+

V2 V3/V4 V2 V3/V4

Postoperative sepsis 37% (VA)5-10% (C/M)

45% (VA)19-44% (C/M)

131

Postoperative thromboembolism

56%58-72% (C/M)

22%42-53% (C/M)

65

Postoperative respiratory failure

63%21-22% (C/M)

68%42-61% (C/M)

147

Postop physiologic/ metabolic derangement

48%12% (M)

63%89% (M)

744

Postop abdominopelvic wound dehiscence

29%22% (M)

72%47% (M)

160

VA=Veterans Affairs; C/M=Cleveland Clinic/Mayo Clinic RochesterSensitivity = TP/(TP+FN) – are all the real cases captured?PPV = TP/(TP+FP) – are all the flagged cases real?LR = Sensitivity/(100-Specificity) – how many times more likely is the event?

Page 18: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Criterion validity of PSIs linked to NSQIPRomano PS, et al. HSR 2009; 44(1):182

Cima RR, et al. Surg 2011; 150:943Koch CG, et al. J Am Coll Surg 2012

Indicator AHRQ NSQIP Registry

Postoperative sepsis Sepsis or septicemia diagnosed by MD (excluding pts w infection)

SIRS w “definitive” evidence of any infection

Postoperative thromboembolism

DVT or PE diagnosed by MD (implied treatment)

DVT or PE treated by MD (PE requires imaging )

Postoperative respiratory failure

Diagnosed by MD or unplanned reintub >0 daysor postop vent >1 day

Postop vent >48 hrs or unplanned reintub (any)

Postop physiologic/ metabolic derangement

Diagnosed by MD and new onset of dialysis (or DKA or hyperosmolar)

New onset of dialysis or ultrafiltration

Postop abdominopelvic wound dehiscence

Surgery to repair postop wound disruption

Postop wound separation w fascial disruption

Page 19: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

PPV of the PSIs based on chart PPV of the PSIs based on chart review by nurse abstractorsreview by nurse abstractors

IndicatorIndicator VAVA AHRQAHRQ UHCUHC%PPV (95%CI)%PPV (95%CI) %PPV (95%CI) %PPV (95%CI) %PPV (95%CI)%PPV (95%CI)

Pressure Ulcer Pressure Ulcer 30 (22-40) 30 (22-40) —— 32 (30-35)32 (30-35)

Foreign Body Left InForeign Body Left In 46 (36-55)46 (36-55) —— ——

Iatrogenic Pneumothorax Iatrogenic Pneumothorax 73 (64-81) 73 (64-81) 78 (73-82) 78 (73-82) ——

CVC-related Bloodstream Infection CVC-related Bloodstream Infection 38 (29-47) 38 (29-47) 61 (51-71) 61 (51-71) ——

Postop Hip Fracture Postop Hip Fracture 28 (15-43)28 (15-43) — — ——

Postop Hemorrhage/Hematoma Postop Hemorrhage/Hematoma 75 (66-83)75 (66-83) 78 (62-95)78 (62-95) ——

Postop Phys/Met Derangement Postop Phys/Met Derangement 63 (54-72)63 (54-72) 85 (78-92)85 (78-92) ——

Postop Respiratory Failure Postop Respiratory Failure 67 (57-76)67 (57-76) —— 83 (77-89)83 (77-89)

Postop PE or DVT Postop PE or DVT 43 (34-53)43 (34-53) 47 (42-52) 47 (42-52) 44 (37-51)44 (37-51)

Postop Sepsis Postop Sepsis 53 (42-64)53 (42-64) 41 (28-54) 41 (28-54) ——

Postop Wound Dehiscence Postop Wound Dehiscence 87 (79-92)87 (79-92) —— ——

Accidental Puncture or Laceration Accidental Puncture or Laceration 85 (77-91)85 (77-91) 91 (86-94)91 (86-94) ——

Rosen, Rosen, Med CareMed Care, 2012; Sadeghi, , 2012; Sadeghi, Am J Med QualAm J Med Qual, 2010; Zrelak, , 2010; Zrelak, J Healthc QualJ Healthc Qual, 2011;, 2011;White, White, Med CareMed Care, 2009; Utter, , 2009; Utter, Ann SurgAnn Surg, 2009; Utter, , 2009; Utter, J Am Coll SurgJ Am Coll Surg, 2010, 2010

Page 20: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Limitations of chart abstraction for Limitations of chart abstraction for criterion validationcriterion validation

Information needed to verify complication Information needed to verify complication may not be available via chart reviewmay not be available via chart review– Complication was not properly evaluated or Complication was not properly evaluated or

described by physicians; vicarious processdescribed by physicians; vicarious process– Absence of evidence vs. evidence of absenceAbsence of evidence vs. evidence of absence

Time constraints limit abstractor’s ability to Time constraints limit abstractor’s ability to assess some aspects of care (e.g., urinary assess some aspects of care (e.g., urinary catheter), may lead to premature terminationcatheter), may lead to premature termination

Much cheaper to look for FPs than FNsMuch cheaper to look for FPs than FNs Inter-hospital variation in physician Inter-hospital variation in physician

documentation and nurse abstractiondocumentation and nurse abstraction Volunteer samples (except VA) Volunteer samples (except VA)

Page 21: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Comparing coding vs. clinical Comparing coding vs. clinical perspective for Postop DVT/PEperspective for Postop DVT/PE

UHC Cohort (n=450) Coding Clinical Sensitivity 80% (46-100%) 100%

Specificity 99.5% (99.3-99.6%) 98.6% (98.6-99.2%)

Positive Predictive Value 72% (67-79%) 44% (36-52%)

Negative Predictive Value 99.6% (98.9-100%) 100%

VA Cohort (n=112) Positive Predictive Value 43% (34-53%)

AHRQ Cohort (n=121) Positive Predictive Value 84% (72-95%) 48% (42-67%)University HealthSystem Consortium cohort includes 505 flagged, randomly sampled surgical cases from 33 volunteer hospitals in 21 states; 450 cases were fully abstracted and submitted to UHC.

Page 22: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

453.4Acute venous embolism and thrombosis of deep vessels of lower extremity453.40 Acute venous embolism and thrombosis of unspecified deep vessels of lower extremityDeep vein thrombosis NOS453.41 Acute venous embolism and thrombosis of deep vessels of proximal lower extremityFemoral, Iliac, Popliteal, Thigh, Upper leg NOS453.42 Acute venous embolism and thrombosis of deep vessels of distal lower extremityCalf, Lower leg NOS, Peroneal, Tibial

453.5Chronic venous embolism and thrombosis of deep vessels of lower extremityExcludes: personal history of venous thrombosis and embolism (V12.51)

453.50 Chronic venous embolism and thrombosis of unspecified deep vessels of lower extremity453.51 Chronic venous embolism and thrombosis of deep vessels of proximal lower extremity453.52 Chronic venous embolism and thrombosis of deep vessels of distal lower extremity

453.6Venous embolism and thrombosis of superficial vessels of lower extremity453.7Chronic venous embolism and thrombosis of other specified vesselsExcludes: personal history of venous thrombosis and embolism (V12.51)

453.71 Chronic venous embolism and thrombosis of superficial veins of upper extremity453.72 Chronic venous embolism and thrombosis of deep veins of upper extremity453.73 Chronic venous embolism and thrombosis of upper extremity, unspecified453.74 Chronic venous embolism and thrombosis of axillary veins453.75 Chronic venous embolism and thrombosis of subclavian veins453.76 Chronic venous embolism and thrombosis of internal jugular veins453.77 Chronic venous embolism and thrombosis of other thoracic veins453.79 Chronic venous embolism and thrombosis of other specified veins

453.8Acute venous embolism and thrombosis of other specified veinsExcludes: cerebral, coronary, intracranial sinus, nonpyogenic, mesenteric, portal, precerebral, pulmonary

453.81 Acute venous embolism and thrombosis of superficial veins of upper extremity453.82 Acute venous embolism and thrombosis of deep veins of upper extremity453.83 Acute venous embolism and thrombosis of upper extremity, unspecified453.84 Acute venous embolism and thrombosis of axillary veins453.85 Acute venous embolism and thrombosis of subclavian veins453.86 Acute venous embolism and thrombosis of internal jugular veins453.87 Acute venous embolism and thrombosis of other thoracic veins453.89 Acute venous embolism and thrombosis of other specified veins

453.9Of unspecified site (embolism of vein, thrombosis (vein))

Page 23: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Methods to reassess Methods to reassess criterion validity of PSI 12criterion validity of PSI 12

Two parallel studies were conducted to update previous Two parallel studies were conducted to update previous PPV estimates for PSI 12 and to identify actionable PPV estimates for PSI 12 and to identify actionable opportunities to improve care:opportunities to improve care:– 7 volunteer hospitals recruited through AHRQ QI listserve, 7 volunteer hospitals recruited through AHRQ QI listserve,

including flagged cases onlyincluding flagged cases only– 15 academic health systems recruited through UHC, including 15 academic health systems recruited through UHC, including

both flagged and unflagged cases with TKA surgeryboth flagged and unflagged cases with TKA surgery

AHRQ PSI 12 Version 4.1 software was applied to AHRQ PSI 12 Version 4.1 software was applied to eligible cases from participating hospitals, using eligible cases from participating hospitals, using “present on admission” (POA) flags.“present on admission” (POA) flags.– Hospital’s own data (AHRQ) or Clinical Database (UHC)Hospital’s own data (AHRQ) or Clinical Database (UHC)

Page 24: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Methods to reassess Methods to reassess criterion validity of PSI 12criterion validity of PSI 12

Flagged cases were reviewed by trained QI nurses at each Flagged cases were reviewed by trained QI nurses at each hospital, using detailed chart abstraction tool and hospital, using detailed chart abstraction tool and guidelines.guidelines.

Detailed review of discrepant cases to identify possible Detailed review of discrepant cases to identify possible reasons for the discrepancy:reasons for the discrepancy:– Present on admission (note one hospital did not apply Present on admission (note one hospital did not apply

denominator exclusions)denominator exclusions)– Location of thrombosis (upper extremity and superficial Location of thrombosis (upper extremity and superficial

thromboses are clinical FPs)thromboses are clinical FPs)– Chronic vs. acute embolism (based on radiographic criteria)Chronic vs. acute embolism (based on radiographic criteria)

Records from volunteer hospitals in AHRQ study were Records from volunteer hospitals in AHRQ study were sampled in sequential reverse order from 6/30/2010 back sampled in sequential reverse order from 6/30/2010 back to 10/1/2009, up to N=30to 10/1/2009, up to N=30

Page 25: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Findings: 7 volunteer hospitalsFindings: 7 volunteer hospitals

From a total of 171 audited charts, 15 From a total of 171 audited charts, 15 cases were excluded post hoc (because cases were excluded post hoc (because hospital did not properly apply POA)hospital did not properly apply POA)

30 cases were False Positive:30 cases were False Positive:– 15 cases were POA15 cases were POA– 8 cases with upper extremity VT8 cases with upper extremity VT– 1 case with SVC (central VT)1 case with SVC (central VT)– 3 cases with superficial VT3 cases with superficial VT– 3 cases were chronic3 cases were chronic

Overall PPV = 81%Overall PPV = 81%

Page 26: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Findings: AMCs with TKA patientsPSI-12 flagged cases

126 VTE flaggedby PSI 12

(+4 Readmission)

125 casesTrue Positive postop lower ext DVT or PE

1 case clinicalFalse Positive(superficial)

saphenous Vein

Positive Predictive Value

= TP / (TP + FP)

= 125 / (125 + 1)

= 0.992

Chart Abstraction

Page 27: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Findings: AMCs with TKA patientsPSI-12 unflagged controls

463Not flagged

as VTE by PSI 12

5 cases had VTE perUHC abstract

458 cases had no VTE

(TN)

3 casesFalse Negative

2 casessuperficial or

upper extremitythromboses

Negative Predictive Value

= TN / (FN + TN)

= 458 / (458+3) = 0.993Chart Abstraction

Page 28: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Estimating sensitivity: Estimating sensitivity: Looking for a needle in a haystackLooking for a needle in a haystack

Retrospective cross-sectional studyRetrospective cross-sectional study– 27 hospitals from 11 states27 hospitals from 11 states– 2006-2009 PSI-negative hospitalizations2006-2009 PSI-negative hospitalizations

Stratified sampleStratified sample– By hospital, risk of being falsely negativeBy hospital, risk of being falsely negative– Oversampled cases at riskOversampled cases at risk

Medical records abstracted locallyMedical records abstracted locally– Trained staff, standard instrumentTrained staff, standard instrument

Analysis with survey statistical methodsAnalysis with survey statistical methods– ““Verification-biased sampling” approach, used Verification-biased sampling” approach, used

model-based weights in analysismodel-based weights in analysis– Incorporated previous estimates of PPVIncorporated previous estimates of PPV

Page 29: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

IndicatorIndicator

Foreign Body Left InForeign Body Left In XX XX

Iatrogenic PneumothoraxIatrogenic Pneumothorax XX XX

Central Venous Catheter InfectionCentral Venous Catheter Infection XX XX

Postop Hemorrhage/HematomaPostop Hemorrhage/Hematoma XX XX

Postop Phys/Met DerangementPostop Phys/Met Derangement XX XX XX XX

Postop Wound DehiscencePostop Wound Dehiscence XX XX XX

Accidental Puncture or LacerationAccidental Puncture or Laceration XX XX XX

Suspicious records:Suspicious records:Could they be false negatives?Could they be false negatives?

Pot

entia

lly m

isco

ded

Pot

entia

lly m

isco

ded

diag

nosi

s

diag

nosi

s

Alte

rnat

ive

diag

nosi

s

Alte

rnat

ive

diag

nosi

s

C

orre

ctiv

e pr

oced

ure

Cor

rect

ive

proc

edur

e

P

art o

f com

posi

te d

ef

Par

t of c

ompo

site

def

Hig

h ri

sk c

ase

Hig

h ri

sk c

ase

Page 30: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Sampling scheme for Sampling scheme for estimating PSI sensitivityestimating PSI sensitivity

Indicator Risk level Sampling Abstracted frame recordsn % n %

Foreign Body Left In Low 664,956 99.91 295 0.04High 589 0.09 21 3.56

Iatrogenic Pneumothorax Low 535,648 99.92 269 0.01High 425 0.08 11 2.59

Central Venous Catheter Infection Low 453,138 99.29 197 0.04High 3,250 0.71 27 0.83

Postop Hemorrhage/Hematoma Low 155,398 97.52 167 0.11High 3,956 2.48 116 2.93

Postop Phys/Met Derangement Low 70,953 98.08 122 0.17High 1,389 1.92 109 7.85

Postop Wound Dehiscence Low 26,123 98.86 91 0.35High 302 1.14 18 5.96

Accidental Puncture or Laceration Low 553,234 98.36 227 0.04High 9,197 1.64 70 0.76

Page 31: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Estimated PSI sensitivityEstimated PSI sensitivityIndicator Abstracted False Sensitivity

records negativerecords

HighN All risk % (95% CI)

Foreign Body Left In 316 0 — 100 (0-100)

Iatrogenic Pneumothorax 279 9 7 25 (8-58)

Central Venous Catheter Infection 223 3 2 11 (1-60)

Postop Hemorrhage/Hematoma 281 32 30 49 (26-72)

Postop Phys/Met Derangement 231 6 6 60 (13-94)

Postop Wound Dehiscence 107 4 4 75 (17-98)

Accidental Puncture or Laceration 296 6 5 94 (75-99)

Page 32: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Postop Hemorrhage/HematomaPostop Hemorrhage/Hematoma

IndicatorIndicator SensitivitySensitivity% (95% CI)% (95% CI)

PPVPPV% (95% CI)% (95% CI)

Current PSI 9 definitionCurrent PSI 9 definition 49 (26-72)49 (26-72) 78 (59-90)78 (59-90)

PSI 9 definitionPSI 9 definition+ codes for treatment of PHH+ codes for treatment of PHH

71 (48-87)71 (48-87) 76 (58-88)76 (58-88)

PSI 9 definitionPSI 9 definition+ codes for treatment of PHH+ codes for treatment of PHH+ codes for eval of bleeding+ codes for eval of bleeding

87 (69-95) 87 (69-95) 77 (61-87) 77 (61-87)

PSI 9 definitionPSI 9 definition- requirement for procedure code- requirement for procedure code

94 (80-98)94 (80-98) 48 (30-66)48 (30-66)

Page 33: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Predictive validity: perspectivesPredictive validity: perspectives

Predicting what?Predicting what?– MortalityMortality– Length of stay, chargesLength of stay, charges– ReadmissionReadmission

Time window for prediction?Time window for prediction? Implied gold standardImplied gold standard

Page 34: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Predictive validity established: Impact of preventing a PSI on mortality, LOS, charges

NIS 2000 analysis by Zhan & Miller, JAMA 2003;290:1868-74

Indicator Δ Mort (%) Δ LOS (d) Δ Charge ($)

Postoperative septicemia 21.9 10.9 $57,700

Selected infections due to medical care 4.3 9.6 38,700

Postop abd/pelvic wound dehiscence 9.6 9.4 40,300

Postoperative respiratory failure 21.8 9.1 53,500

Postoperative physiologic or metabolic derangement

19.8 8.9 54,800

Postoperative thromboembolism 6.6 5.4 21,700

Postoperative hip fracture 4.5 5.2 13,400

Iatrogenic pneumothorax 7.0 4.4 17,300

Decubitus ulcer 7.2 4.0 10,800

Postoperative hemorrhage/hematoma 3.0 3.9 21,400

Accidental puncture or laceration 2.2 1.3 8,300

Excess mortality, LOS, and charges computed from mean values for PSI cases and matched controls.

Page 35: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Predictive validity established: Impact of preventing a PSI on mortality, LOS, charges

VA PTF 2001 analysis by Rivard et al., Med Care Res Rev; 65(1):67-87

Indicator Δ Mort (%) Δ LOS (d) Δ Cost ($)

Postoperative septicemia 30.2 18.8 $31,264

Selected infections due to medical care 2.7 9.5 13,816

Postop abd/pelvic wound dehiscence 11.7 11.7 18,905

Postoperative respiratory failure 24.2 8.6 39,745

Postoperative physiologic or metabolic derangement

Postoperative thromboembolism 6.1 5.5 7,205

Postoperative hip fracture

Iatrogenic pneumothorax 2.7 3.9 5,633

Decubitus ulcer 6.8 5.2 6,713

Postoperative hemorrhage/hematoma 5.1 3.9 7,863

Accidental puncture or laceration 3.2 1.4 3,359

Excess mortality, LOS, and charges computed from mean values for PSI cases and matched controls.

Page 36: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Predictive validity questionable based on NIS/VA Zhan & Miller, JAMA 2003;290:1868-74 Rosen et al., Med Care 2005;43:873-84

Indicator Δ Mort (%) Δ LOS (d) Δ Charge ($)

Birth trauma -0.1 (NS) -0.1 (NS) 300 (NS)

Obstetric trauma –cesarean -0.0 (NS) 0.4 2,700

Obstetric trauma - vaginal w/out instrumentation

0.0 (NS) 0.05 -100 (NS)

Obstetric trauma - vaginal w instrumentation

0.0 (NS) 0.07 220

Complications of anesthesia* 0.2 (NS) 0.2 (NS) 1,600

Transfusion reaction* -1.0 (NS) 3.4 (NS) 18,900 (NS)

Foreign body left during procedure† 2.1 2.1 13,300

* All differences NS for transfusion reaction and complications of anesthesia in VA/PTF.

† Mortality difference NS for foreign body in VA/PTF.

Page 37: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Comparison with EnglandComparison with England

Raleigh VS, Cooper J, Bremner SA, Scobie S, BMJ, 2008, 337; a1702

Zhan C, Miller MR. JAMA 2003;290:1868-74.

Rates/1000 admissionsMatched cases: excess mortality

Indicator England

(2005/6) US (2000)England

(2005/6) US (2000)

Death in low mortality HRGs 0.41 —

Decubitus ulcer 7.17 21.51 13.4 7.2

Iatrogenic pneumothorax 0.12 0.67 10.6 7.0

Infections due to medical care 1.06 1.99 5.7 4.3

Postoperative hip fracture 0.08 0.77 18.2 4.5

Postoperative sepsis 2.66 11.25 27.1 21.9

Obstetric trauma:

Vaginal with instrument 60.34 224.21 * 0.0

Vaginal without instrument 29.39 86.61 0.01 0.0

Caesarean 2.86 6.97 * -0.02

Page 38: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Construct validity: perspectivesConstruct validity: perspectives

Is the construct sound?Is the construct sound? Correlation with process measuresCorrelation with process measures

– Do we have the right process measures?Do we have the right process measures?

Correlation with structural measuresCorrelation with structural measures– Do we have the right structural measures?Do we have the right structural measures?

Correlation with other outcome domainsCorrelation with other outcome domains

Page 39: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Case control study of PSI 12Case control study of PSI 12

Cases (up to 20):Cases (up to 20):– Uni-TKA or Bilat-TKAUni-TKA or Bilat-TKA– Oct 2008 to Mar 2010Oct 2008 to Mar 2010– >40 yrs>40 yrs– PSI-12 code for VTE PSI-12 code for VTE

within 90 days within 90 days

Controls (up to 40):Controls (up to 40):– Uni-TKA or Bilat-TKAUni-TKA or Bilat-TKA– Oct 2008 to Mar 2010Oct 2008 to Mar 2010– >40 yrs>40 yrs– NO PSI-12 code for NO PSI-12 code for

VTE within 90 daysVTE within 90 days

•No TKA or THA within 90 days prior•No VTE as principal ICD-9-CM diagnosis•No VTE as POA•No pregnancy, childbirth, or puerperium

Page 40: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Analysis of case control dataAnalysis of case control data

Classified FDA-approved pharmacologic prophylaxis as Classified FDA-approved pharmacologic prophylaxis as receipt of the recommended dose at the recommended receipt of the recommended dose at the recommended starting time (per package insert) before or after surgery, starting time (per package insert) before or after surgery, and continued until at least the day of dischargeand continued until at least the day of discharge

Patients who were diagnosed with VTE on the day of Patients who were diagnosed with VTE on the day of surgery or the day after surgery were not included in the surgery or the day after surgery were not included in the case control analysiscase control analysis

Other risk factors assessed included age, obesity (BMI), Other risk factors assessed included age, obesity (BMI), type of TKA, race/ethnicity, date of ambulation, personal type of TKA, race/ethnicity, date of ambulation, personal or family history of VTE, and comorbid conditionsor family history of VTE, and comorbid conditions

Analysis adjusted for conditional stratified sampling of Analysis adjusted for conditional stratified sampling of controls without VTEcontrols without VTE

Page 41: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

41

Multivariable analyses of Multivariable analyses of process factorsprocess factors

Multivariate adjusted odds ratios and 95% confidence intervals Multivariate adjusted odds ratios and 95% confidence intervals Outcome: Any VTE event diagnosed Day 2 of surgery or laterOutcome: Any VTE event diagnosed Day 2 of surgery or laterExcluded one hospital that screened TKA patients routinely for VTE Excluded one hospital that screened TKA patients routinely for VTE

Predictive Factor Odds Ratio (95% CI) P value

Age 1.02 (0.99 – 1.05) 0.20

Gender (ref: male) 1.7 (0.9 – 2.9) 0.90

Ambulation (ref: no ambulation)Taking steps day 1 or 2Taking steps after day 2

0.3 (0.1 – 0.9)

0.7 (0.2 – 2.1)

0.005

0.56

Type of TKA (ref: unilateral TKA)Bilateral TKR 4.2 (1.9 – 9.1) 0.004

Recommended pharmacologic prophylaxis (ref: only mechanical)

0.5 (0.3 – 0.8) 0.01

BMI ≥ 35 (ref: BMI < 35) 0.9 (0.5 – 1.6) 0.66

Page 42: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Implications of recent Implications of recent validation studiesvalidation studies

Generalizability to nonparticipating hospitals Generalizability to nonparticipating hospitals remains unclearremains unclear

Limited supervision/oversight of local Limited supervision/oversight of local abstractors who may have COIabstractors who may have COI

ICD-9-CM changes and associated coder ICD-9-CM changes and associated coder training can substantially improve criterion training can substantially improve criterion validity of PSIs, in the right circumstancesvalidity of PSIs, in the right circumstances

Actionable opportunities to improve care may Actionable opportunities to improve care may persist despite 100% compliance with TJC persist despite 100% compliance with TJC process measures, but often we don’t know process measures, but often we don’t know what processes to use for construct validationwhat processes to use for construct validation

Estimating sensitivity or FN rate is still hardEstimating sensitivity or FN rate is still hard

Page 43: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Future directions in Future directions in validationvalidation

Encourage ongoing criterion validation work Encourage ongoing criterion validation work using previously developed tools or new using previously developed tools or new learning collaborativeslearning collaboratives

Continue to pursue opportunities to link Continue to pursue opportunities to link registry or EHR data, especially laboratory, registry or EHR data, especially laboratory, imaging, vital signs, etc.imaging, vital signs, etc.

Encourage case control and intervention Encourage case control and intervention studies when we know what processes to studies when we know what processes to measure or changemeasure or change

Page 44: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Improving indicator Improving indicator performanceperformance

Change indicator specifications to capture Change indicator specifications to capture false negatives or exclude false positivesfalse negatives or exclude false positives

Change ICD-9-CM codes, coding Change ICD-9-CM codes, coding guidelines, or Coding Clinic adviceguidelines, or Coding Clinic advice

Promote universal adoption of POA flagPromote universal adoption of POA flag Work with hospitals to improve and stan-Work with hospitals to improve and stan-

dardize clinical documentation and codingdardize clinical documentation and coding Everything will change in ICD-10-CM/PCS Everything will change in ICD-10-CM/PCS

(10/1/2014)(10/1/2014)

Page 45: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Survey of AMC coders about Survey of AMC coders about postop respiratory failurepostop respiratory failure

56 coders surved at 20 volunteer AMCs.56 coders surved at 20 volunteer AMCs. When a hospital’s coders reported being When a hospital’s coders reported being

more likely to “query physicians when a more likely to “query physicians when a patient appears to have experienced PRF, patient appears to have experienced PRF, but the physician did not document a but the physician did not document a specific diagnosis” (by 1 level on a 5-level specific diagnosis” (by 1 level on a 5-level Likert), the hospital:Likert), the hospital:– Had 1.8 (1.2-2.8) times higher use of 518.5.Had 1.8 (1.2-2.8) times higher use of 518.5.– Had 1.0 (0.6-1.4) times higher use of 518.8xHad 1.0 (0.6-1.4) times higher use of 518.8x

Page 46: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

http://qualityindicators.ahrq.gov/ValidationPilot.aspxhttp://qualityindicators.ahrq.gov/ValidationPilot.aspx

Page 47: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

Acknowledgments and referencesAcknowledgments and references

Participating hospitals and systems: Advocate Health Care, Barnes Participating hospitals and systems: Advocate Health Care, Barnes Jewish Hospital, Harborview Med Ctr, Nebraska Medical Ctr, Stanford Jewish Hospital, Harborview Med Ctr, Nebraska Medical Ctr, Stanford Hospital, UC Davis Medical Ctr, Vassar Brothers Med Ctr.Hospital, UC Davis Medical Ctr, Vassar Brothers Med Ctr.

AHRQ Quality Indicators project team: Mamatha Pancholi, John BottAHRQ Quality Indicators project team: Mamatha Pancholi, John Bott Battelle validation support: Jeffrey Geppert, Jaime LiesmannBattelle validation support: Jeffrey Geppert, Jaime Liesmann UC Davis: Pat Zrelak, Banafsheh Sadeghi, Richard White, Garth UtterUC Davis: Pat Zrelak, Banafsheh Sadeghi, Richard White, Garth Utter Sadeghi B, et al. Cases of iatrogenic pneumothorax can be identified Sadeghi B, et al. Cases of iatrogenic pneumothorax can be identified

from ICD-9-CM coded data. Am J Med Qual 2010; 25(3):211-7.from ICD-9-CM coded data. Am J Med Qual 2010; 25(3):211-7. White RH, et al. How valid is the ICD-9-CM based AHRQ Patient White RH, et al. How valid is the ICD-9-CM based AHRQ Patient

Safety Indicator for postoperative venous thromboembolism? Med Safety Indicator for postoperative venous thromboembolism? Med Care 2009; 47(12):1237-43. Care 2009; 47(12):1237-43.

White RH, et al. Evaluation of the predictive value of ICD-9-CM coded White RH, et al. Evaluation of the predictive value of ICD-9-CM coded adminstrative data for venous thromboembolism in the United States. adminstrative data for venous thromboembolism in the United States. Thromb Res 2010; 126(1):61-7.Thromb Res 2010; 126(1):61-7.

Sadeghi B, et al. Mechanical and suboptimal pharmacologic Sadeghi B, et al. Mechanical and suboptimal pharmacologic prophylaxis and delayed mobilization but not morbid obesity are prophylaxis and delayed mobilization but not morbid obesity are associated with venous thromboembolism after total knee arthroplasty: associated with venous thromboembolism after total knee arthroplasty: A case-control study. J Hosp Med 2012; in press.A case-control study. J Hosp Med 2012; in press.

Page 48: Current evidence and emerging methods regarding validation of the AHRQ QIs Patrick S. Romano, MD MPH UC Davis Center for Healthcare Policy and Research

QUESTIONS?QUESTIONS?