six sigma healthcare dea final paper
TRANSCRIPT
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
1/27
Int. J. Six Sigma and Competitive Advantage, Vol. 4, No. 3, 2008 305
Data envelopment analysis models for identifying andbenchmarking the best healthcare processes
James C. Benneyan*
Department of Mechanical and Industrial Engineering
Northeastern University
360 Huntington Avenue, Boston, MA 02115, USA
Fax: 6173732921
E-mail: [email protected]
*Corresponding author
Aysun Sunnetci and Mehmet Erkan Ceyhan
Northeastern University
334 Snell Engineering Center
Boston, MA 02115, USA
E-mail: [email protected]
E-mail: [email protected]
Abstract: We illustrate the use of Data Envelopment Analysis (DEA) modelswithin process improvement work for identifying and benchmarking the besthealthcare systems, in terms of most efficiently producing desirable outcomesfrom consumed resources. This approach is useful when comparing severalsystems that use multiple types of inputs (e.g., operating costs, clinicians, staff)to produce multiple outputs (e.g., outcomes, satisfaction, access), such as thosecommonly found in balanced scorecards and dashboard datasets, and providesthe analyst with relative scores and rankings for each system, targets for eachmeasure that would move inefficient systems to the best performance frontier,and a list of other systems to benchmark and emulate in order to improve.Modified DEA models are proposed to address four common issues thatfrequently arise in such contexts, including rationally constraining the weightsgiven to each measure and handling missing, estimated or proportional data(such as adverse event or mortality rates). These models can be used tocompare hospitals, departments, national healthcare systems, and regional orstate systems and are useful to help understand how to improve sub-optimalprocesses and set feasible targets. This approach is illustrated at department,hospital, state, and country levels, with overall results showing very littlecorrelation with less quantitative benchmarking studies.
Keywords: benchmarking; healthcare; data envelopment analysis; DEA;weight restrictions; proportional data; hyper-efficiency.
Reference to this paper should be made as follows: Benneyan, J.C.,Sunnetci, A. and Ceyhan, M.E. (2008) Data envelopment analysis models foridentifying and benchmarking the best healthcare processes, Int. J. Six Sigmaand Competitive Advantage, Vol. 4, No. 3, pp.305331.
Copyright 2008 Inderscience Enterprises Ltd.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
2/27
306 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Biographical notes: James C. Benneyan, PhD, is an Associate Professor of
Industrial Engineering and Operations Research and the Director of the Qualityand Productivity Laboratory at Northeastern University, USA, a faculty forthe Institute for Healthcare Improvement and Advisor to several nationalhealthcare improvement projects. Previously, he was a Senior SystemsEngineer at Harvard Community Health Plan and is a past President andFellow of the Society for Health Systems. His research areas include statisticalmethods for quality improvement, healthcare systems engineering andoperations research in nanotechnology.
Aysun Sunnetci received her PhD in Industrial Engineering from NortheasternUniversity in Boston, Massachusetts, USA. Her research addresses two DEAproblems that frequently arise in practice: the handling of proportional data inconstant-returns-to-scale models and methods for constraining the weighting ofdecision variables within these models.
Mehmet Erkan Ceyhan is a PhD candidate in Industrial Engineering atNortheastern University in Boston, Massachusetts, USA. His research focuseson estimated proportions and ranked data in DEA and benchmarking analysisof national healthcare systems.
1 Introduction
In quality improvement and six sigma activities, benchmarking serves an important role
for identifying best practices, understanding deficiencies, and setting targets (Burstin
et al., 1999). First employed by Xerox in the 1970s, benchmarking has become
a common business practice for supporting continuous process improvement andmanagement decision making (McNair and Leibfried, 1992). In the classic Six Sigma
Define, Measure, Analyse, Improve, Control (DMAIC) approach, for example,
benchmarking can contribute to the measurement, analysis, and improvement activities.
This paper discusses and illustrates the use of Data Envelopment Analysis (DEA) for
benchmarking healthcare systems within these types of process improvement or Six
Sigma contexts. The intent is to illustrate how DEA can be used within these contexts
through a variety of examples, rather than provide a comprehensive review of DEA
theory or detailed results of each study; where appropriate references are provided to
such information and to each of the cited studies.
In general, benchmarking activities compare processes across organisations
(Stevenson, 1998), including efforts to identify potential comparison partners, understand
relative strengths and weaknesses, identify areas for improvement, determine gaps, and
set goals (Collins-Fulea et al., 2005). An experience of Westinghouses Electric Systems
Division is a successful example, becoming world class in part by adapting better
processes for material handling from Texas Instruments, subcontracting from Boeing, and
work team organisation from Rockwell. Within healthcare, a study by Solucient found
that annually 57 000 additional patients would survive, 18% fewer medical complications
would occur, average hospital lengths of stay would decrease significantly, and $9.5
billion would be saved if all hospitals in the USA performed as well as the best hospitals
(Chenoweth, 2003).
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
3/27
DEA models for identifying and benchmarking the best healthcare processes 307
Despite the clear value of identifying and transferring best practices, many
benchmarking approaches are fairly subjective in the manner by which they weight
performance metrics and determine top performers, often including qualitative
comparisons, questionnaires and surveys, expert assessments, and case study
comparisons. Often some type of score is computed for each organisation by applying
largely subjective weights or ranks to various measures, as described below.
Additionally, while most benchmarking tools identify an organisations relative strengths,
few provide additional information to help the underperforming organisations improve,
set goals, and identify peer organisations to emulate. In contrast, DEA is a quantitative
optimisation method for comparing entities (called Decision-making Units or DMUs)
in order to mathematically determine their relative efficiencies, assign weights to each
variable, set targets, and identify the best DMUs for further study.
2 Methodology
2.1 Efficiency frontier model
Originally developed within the operations research and econometrics communities, data
envelopment analysis is a mathematical method, based in linear programming models, for
comparing the relative efficiencies of multiple decision-making units at transforming the
multiple types of inputs each consumes into the types of multiple outputs each produces
(Charnes et al., 1981; Cooper et al., 2000). Inputs might include the number of clinical
staff, nurse-to-patient levels, and operating costs per patient day, whereas outputs might
include clinical outcomes, access, patient satisfaction, and safety. DEA mathematically
compares these measures across all DMUs in order to construct a production efficiency
or best-practice frontier consisting of those organisations that achieve the best weightedcombination of maximal outputs from minimal inputs (Medina-Borja et al., 2007).
Conceptually, the most efficient DMUs define an efficiency frontier that envelops
all the other DMUs, as illustrated graphically in Figure 1 for a simple one-input
one-output case, where the horizontal and vertical axes correspond to input and output
levels, respectively. In this example, the DMUs labelled A, B, and C comprise the
Variable Returns-to-Scale (VRS) frontier, whereas the inefficient DMUs D, E, and F can
reach this frontier by producing either the same amount of output with less input or more
output with same amount of input. Mathematically, a set of fractional optimisation
programmes based on total weighted output-over-input ratios (transformed into Linear
Programmes (LPs) for ease of solution) is solved to determine four results for each DMU:
1 an overall efficiency score between 0 and 1 relative to the other DMUs (where a
score of 1 indicates the DMU (e.g., facility) is on the frontier)
2 optimal weights for each input and output that maximise this score relative to those
of all other DMUs
3 target values for each input and output that would move this DMU onto the
efficiency frontier (if not currently best-in-class)
4 a subset list of the other DMUs that form a reference set for further study and
benchmarking (where becoming a weighted combination of these DMUs would
move a non-frontier entity to best in class).
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
4/27
308 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Figure 1 Example of constant and variable returns-to-scale efficiency frontiers
A
B C
D
E
F
input
output CRS frontier
VRSfrontier
These LPs are solved iteratively, once for each DMU, along with first and second phase
dual models (Cooper et al., 2000) to produce the results described above. While several
different formulations exist, in general all DEA models seek to maximise the ratio of a
weighted sum of outputs over a weighted sum of inputs, as shown in Table 1, where Kis
the number of DMUs,Mis the number of outputs,Nis the number of inputs, and e is the
current DMU being measured. DEA models can assume either Constant Returns-to-Scale
(CRS) or VRS in the relationship between inputs and outputs, as illustrated in Figure 1,
and can be input- or output-oriented. An input-oriented model aims to minimise the level
of inputs while producing the same level of outputs, whereas an output-oriented model
aims to maximise the level of outputs while consuming the same level of inputs. These
two orientation formulations identify the same efficient and inefficient DMUs (in the
CRS case with the output-oriented efficiency score equal to the reciprocal of that of the
input-oriented model), but with different targets, weights, and reference sets.
Table 1 Constant Returns to Scale (CRS) input and output-oriented DEA models (in fractionalprogramme form)
Input oriented CRS model Output oriented CRS model
1
1
1
1
maximise
subject to 1 1
0 1
10
M
j je
j
e N
i ie
i
M
j jk
j
N
i ik
i
j
i
u O
z
v I
u O
k ,...,K
v I
u j ,...,M
i ,...,N v
=
=
=
=
=
=
=
=
1
1
1
1
minimise
subject to 1 1
0 1
10
N
j ie
ie M
j je
j
N
j ik
i
M
j jk
j
j
i
v I
z
u O
v I
k ,...,K
u O
u j ,...,M
i ,...,N v
=
=
=
=
=
=
=
=
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
5/27
DEA models for identifying and benchmarking the best healthcare processes 309
2.2 A simple example
Figure 2 illustrates the general framework of a typical DEA study, here comparing six
hypothetical hospitals each with three inputs (cost/charge ratio, FTE/bed ratio, and a
case-mix adjusted average length of stay index) and three outputs (adjusted mortality,
patient satisfaction, and access) that might be desirable to minimise and maximise,
respectively (since outputs are maximised in DEA, mortality rates were converted to
non-mortality rates). As shown, Hospitals 4 and 6 are top ranked and on the frontier (with
scores of 1.0) and appear in the peer benchmark sets of the other four hospitals, implying
that the others can improve by studying and emulating them. The next step in such an
analysis would be to conduct an in-depth study of these two hospitals to develop insights
as to how they are able to perform better. In order to illustrate this approach and the
breadth of uses of DEA within healthcare, several recent studies are summarised below,
at times using modified models as described in the following section.
Figure 2 Illustrative example of DEA analysis of six hypothetical hospitals
.02.03.012
days7.98H6
.25.30.2020
days30.35H5
.10.05.03
14
days24.47H4
.4.12.1010
days20.83H3
.25.075.0254
days14.85H2
.20.25.153
days8.95H1
Adj
LOS
index
FTE /
bed
ratio
Cost /
charge
ratio
Acce
ss
Pat.
Sat.
Adj
Mort
ality
InputsOutcomes
Hospi
tal
.02.03.012
days7.98H6
.25.30.2020
days30.35H5
.10.05.03
14
days24.47H4
.4.12.1010
days20.83H3
.25.075.0254
days14.85H2
.20.25.153
days8.95H1
Adj
LOS
index
FTE /
bed
ratio
Cost /
charge
ratio
Acce
ss
Pat.
Sat.
Adj
Mort
ality
InputsOutcomes
Hospi
tal
3611.000H6
0440.571H5
4411.000H4
04, 650.414H3
06, 430.734H2
04, 660.134H1
Freq
bench-
marked
PeersRankScore
DEA Results
Hospital
3611.000H6
0440.571H5
4411.000H4
04, 650.414H3
06, 430.734H2
04, 660.134H1
Freq
bench-
marked
PeersRankScore
DEA Results
Hospital
Note: Mortality is converted to non-mortality in order to be a larger is better output.
2.3 Model extensions
Two modelling issues that frequently arise in many healthcare applications include
proportional data (often estimated or missing) and irrational weights computed for
some measures. In the first case, many key healthcare data are proportions bound
between 0 and 1 (such as mortality, infection, adverse event, and appointment access
rates), violating the usual DEA assumption that all data can take any positive value.
Similar data also arise in other industries, such as defect, graduation, and customer
retention rates. Scalar data bound on a fixed interval present a similar problem, such as
patient satisfaction scores between 1 and 5 or life expectancies, as opposed to being
unbounded above.
Solving conventional CRS models in such cases theoretically can produce
nonsensical target values that exceed their upper possibilities (e.g., 130% survival or 420
years life expectancy). Borrowing an idea from logistic regression (Amemiya, 1985),
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
6/27
310 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
a simple Odds-Ratio (OR) transformation instead can be used to ensure all targets lie
within their logical bounds, converting each proportion p on the (0,1) interval to apositive real number odds ratiop/(1 p), offering the modeller an easy alternative when
VRS relationships are not appropriate; notationally:
,
,
,1
k jOR
k j
k j
II
I=
(1)
and
,
,
,
,1
k iOR
k i
k i
OO
O=
(2)
for the proportional j-th inputIk,j or i-th output Ok,i of DMU k, where now ,
and , Substituting these odds-ratios for all proportional inputs and outputs,DEA models can be solved in the usual manner, with the resultant odds-ratio targets
0 ORk i
O< <
.0 < < ORk jI*
,
OR
k jI
and then back-transformed to proportional targets*,
OR
k iO
*
,k jI and as:*
,k iO
*
, *
,
1
1k j OR
k j
II
=+
(3)
and
*
, *
,
1.
1k i OR
k i
OO
=+
(4)
The impact of this approach on efficiency scores, weights, reference sets, peer weights,
and targets is illustrated below and explored in greater detail by Benneyan and Sunnetci(2008). Approaches to the related modelling problems of non-proportional data bound on
an (a, b) interval (such as ratings between 1 and 10), estimated probabilities, and missing
data also are discussed by the above authors, Ceyhan and Benneyan (2008), Benneyan
et al. (2006), and Aksezer and Benneyan (2003), including multiple imputation,
bootstrapping, and Monte Carlo methods.
A second periodic problem that arises when using DEA to benchmark healthcare
systems is the production of irrational weights, such as placing greater weight on patient
satisfaction than on mortality (in the extreme case with zero weight essentially ignoring
important variables). Several possible modelling approaches to address this problem are
summarised in Table 2 and described below, the first two taken from the DEA literature,
along with their advantages and disadvantages.
The simplest approach is to rank order all weights via additional constraints that force
the desired relative ordering, e.g., u2 u4 or v1v2v3, although this typically produces
equal weights if the constraint would have been violated in the unbounded case. A second
frequent approach is to assign upper or lower bounds to weights, such as v1a or u2b
where a and b are some desired constants. Since weights mathematically are unbounded
above, however, these values are somewhat meaningless. An extension of this idea that
lends more meaning, however, is to specify or bound the percentages that each measure
can receive from the total weight given to all measures, e.g., ui = gi(u1 + u2 + uM)
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
7/27
DEA models for identifying and benchmarking the best healthcare processes 311
or vjhj(v1 + v2 + vN), where g1 + g2 + gM = h1 + h2 + hN = 1 (Sunnetci and
Benneyan, 2008). These Percent-of-Total (POT) constraints can be limited further to
desired ranges, i.e., aigibiand cjhjdj, or only specified for some of the weights.
Table 2 Possible weight-restricting approaches, advantages, and disadvantages
Approach Example Advantages Disadvantages
Simpleranking
u2u1
u2u3
Easily applied.
Prevents the problem ofallowing more weight onless important variables.
Does not prevent zero weights.
Typically produces equalweights (e.g., u2 = u1).
Lowerbounds
v1 0.43 u1 0.14
v2 0.33 u2 0.58
v3 0.18 u3 0.22
Prevents problems ofirrational ranking and
zero weights.
Lower bounds lack muchmeaning since weights are
unbounded above.Difficult to determine or agreeon arbitrary bounds.
Frequently a feasible solutioncannot be found (especially iflower bound >> zero).
Percentof Total(POT)
u1 = .25(u1 + u2 + u3)
u2 = .50(u1 + u2 + u3)
u3 = .25(u1 + u2 + u3)
Prevents bothabove problems.
Hard to determine specificpercentages, which are stillsomewhat subjective.
Notes: First two methods in literature, third method is proposed here.
Given that these POT gi and hj values also may be somewhat subjective, the fraction
of the entire possible (gi, hj) space can be identified for which each particular DMU is
efficient, referred to here as its hyper-efficiency score, with any DMUs on the frontier
for all possible values called hyper-efficient. These results can be identified or
estimated by iterative search, numerical methods, or a Monte Carlo scattering approach
that repeatedly solves the DEA models using random (gi, hj) values, somewhat measuring
a DMUs efficiency robustness using any set of weights. An alternate method to address
arbitrary weights, called cross-efficiency, computes the average efficiency score for each
DMU based only on the optimal weights of all other DMUs (Sexton et al., 1986; Doyle
and Green, 1994), in essence considering how efficient a DMU would be using (only) the
weights of the other DMUs.
In the below examples, all analyses were conducted using CRS output-oriented
models with all proportions and scalar data transformed to OR and all smaller-is-better
outputs (such as AE and mortality rates) subtracted from 1; weight-restrictions ormissing data imputation are noted when used and weighting robustness is measured via
hyper-efficiency.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
8/27
312 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
3 Applications
3.1 Hospital benchmarking
To illustrate a basic DEA study and the above modelling approaches, Table 3 summarises
an analysis of 17 hospitals, where the provided inputs were the costs of administration
and support, information systems, supplies, lab and imaging, nursing, and ancillary
services and rehabilitation. The outputs of interest were various clinical outcome and
patient safety measures (surgery quality, Cesarean related quality, failure to rescue rates,
surgery adverse event rates, delivery adverse event rates, and post operation adverse
event rates) that also serve as surrogates for the overall process quality. For each hospital
in Table 3, the first, second, and third rows contain their current data, targets, and
weights, respectively.
As shown in the second column, seven hospitals are on the best-practice frontier (with
scores of 1.0 and targets equal to their original values since they already are the topperformers). The challenge for inefficient hospitals is to benchmark those on the frontier
(or find other ways to reduce their inputs and increase their outputs to their computed
targets) in order to become as good as those with scores of 1. For example, Hospital 1
would become top-ranked if it could change its inputs and outputs to the target
levels shown in its second row (i.e., reduce its administration and support costs from
$3,619 to $1,578 and its surgery adverse event rate from .0473 to .0003, along with the
other targets).
Also note that, as described above, the DEA model set several weights irrationally in
order to maximise some DMUs scores. Hospital 2, for example, has been made to appear
efficient by setting the weights equal to zero for inputs 1, 2, 5, and 6 and for outputs 1, 2,
3, 4, and 5, placing little to no weight on measures for which this hospital performs
poorly. Additionally, Cesarean related quality has been weighted significantly lower(by more than 90%) than surgery quality.
In a similar analysis, Table 4 summarises unrestricted DEA results for the US News
and World Report (USNWR) annual published study of the best US hospitals, which in
2007 placed 17 hospitals on an honour roll. As above, the first and second rows for each
hospital contain the targets and weights, whereas the first and second (in parentheses)
values in the score column are the DEA and USNWR scores, respectively, where the
USNWR results were computed using a subjective weighting scheme where structure,
process, and outcome measures each received one-third of the weight (McFarlane et al.,
2007). Duke University Medical Center, for example, would become top-ranked if
it could achieve the target values shown in its first row. Note again, however, that
some measures receive zero or irrational weights (highlighted in italics and grey
shading, respectively).
As shown in Figure 3, furthermore, in contrast to the USNWR scores (R2 = 0.8535,
p = 0.00000005), the hospital-wide and average department DEA scores (unrestricted)
have little correlation to each other (R2 = 0.1288, p = 0.1436). The DEA results were
calculated by solving a separate model for the hospitals and for each type of department,
as described below, with low and variable correlations between departments suggesting
process and practice differences within hospitals. The correlations in Table 5 summarise
these differences for both the DEA and USNWR results.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
9/27
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
10/27
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
11/27
DEA models for identifying and benchmarking the best healthcare processes 315
Table 4 Unrestricted CRS model results for US News and World Report honour roll
(hospital-wide)
Inputs Outputs
Hospital
DEA
(USNWR)
scoreNursing
index
Advanced
services
Patient
services Reputation
Non-
mortality Discharges
T 1.9000 51.0000 85.0000 33.5750 1.6925 32 127.00Johns
Hopkins
1
(1) W 0.1152 0.0153 0.0000 0.0298 0.0000 0.0000
T 2.8000 50.5000 85.0000 36.8000 1.5748 95 317.00Mayo Clinic 1
(0.967) W 0.0000 0.0198 0.0000 0.0272 0.0000 0.0000
T 2.4000 57.0000 49.0000 15.8917 1.7167 25 504.00UCLA 1
(0.833) W 0.0000 0.0108 0.0079 0.0130 0.4619 0.0000
T 2.0000 57.0000 75.0000 25.7750 1.4797 60 769.00Cleveland 1
(0.833) W 0.2163 0.0018 0.0062 0.0052 0.4369 0.0000
T 2.0000 55.0000 75.0000 25.2516 1.4600 62 920.37Mass General 0.8983
(0.767) W 0.2416 0.0020 0.0070 0.0058 0.4881 0.0000
T 1.7000 58.0000 85.0000 14.1750 1.3986 86 964.00NY
Presbyterian
1
(0.700) W 0.1708 0.0122 0.0000 0.0000 0.0000 0.00001
T 1.6000 51.0874 74.0000 12.6984 1.4425 44 194.96Duke
University
0.9122
(0.600) W 0.3697 0.0000 0.0068 0.0031 0.6356 0.0000
T 2.2000 56.0000 61.0000 16.6399 1.7133 27 323.44UCSF 0.8439
(0.600) W 0.2812 0.0042 0.0054 0.0013 0.6257 0.0000
T 2.1000 56.5000 85.0000 15.9637 1.6534 70 264.03Barnes-J. 0.8361
(0.567) W 0.2723 0.00790 0.0021 0.0000 0.5887 0.0000
T 2.3000 58.0000 75.0000 11.8118 1.8404 39 213.05Brigham &
Womens
0.9006
(0.533) W 0.2445 0.0030 0.0050 0.0000 0.5402 0.0000
T 2.1000 40.5000 64.9492 16.9696 1.5740 31 918.15U WA 0.9232
(0.500) W 0.2676 0.0129 0.0000 0.0000 0.6882 0.0000
T 1.5000 49.0000 71.0000 8.1300 1.4815 22 703.00U Penn 1
(0.367) W 0.3000 0.0112 0.0000 0.0000 0.6750 0.0000
T 1.9000 49.0000 71.0000 10.5761 1.4471 61 973.02U Pitt 0.8552
(0.333) W 0.2961 0.0037 0.0060 0.0000 0.6543 0.0000
T 2.4000 54.0000 77.0000 9.8572 1.8367 47 372.89U Mich 0.7630
(0.300) W 0.2829 0.0035 0.0057 0.0000 0.6252 0.0000
T 1.8000 47.0000 52.0000 10.9319 1.4307 22 887.03Stanford 0.9394(0.267) W 0.3095 0.0039 0.0063 0.0000 0.6840 0.0000
T 2.5000 36.0000 55.0000 4.7000 1.6453 35 234.00Yale 1
(0.267) W 0.0000 0.0278 0.0000 0.0135 0.5692 0.0000
T 2.0000 47.0000 70.0000 10.0143 1.4842 59 521.52Cedars 0.9836
(0.233) W 0.2606 0.0076 0.0020 0.0000 0.5634 0.0000
T 2.3000 50.0000 63.0000 9.8120 1.7339 30 351.05U Chicago 0.8998
(0.233) W 0.2454 0.0048 0.0049 0.0000 0.6410 0.0000
Notes: T = Target, W = Weight.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
12/27
316 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Figure 3 Comparison of hospital-wide and average department scores for US News and World
Report versus DEA results (see online version for colours)
0
0.25
0.5
0.75
1
0 0.25 0.5 0.75 1Hospital-Wide Score
AverageDepartmentScore DEA (R2 = 0.1288,p = 0.1436)
USNWR (R2 = 0.8535,p = 0.00000005)
Table 5 Cross-correlations of department DEA results
Department
Cancer
Digestive
Disorders
Ear-Nose-
Throat
Endo-
crinology
Geriatrics
Gynecology
Heart
Kidney
Disease
Neurology&
Neuro-
surgery
Ortho-
pedics
Respiratory
Disorders
Urology
1
(1)
-.0006 1
(.6746) (1)
.1702 .6122 1
(.6706) (.2063) (1)
.1603 .4651 .5467 1
(.5808) (.8146) (.1478) (1)
.1696 .4605 .5949 .5536 1
(.5370) (.4046) (.3652) (.2751) (1)
.0238 .3980 .4095 .3302 .2712 1
(.5261) (.5153) (.3390) (.3552) (.3446) (1)
.1765 .8165 .5244 .6013 .7360 .5159 1
(.3056) (.7670) (.1305) (.4790) (.0808) (.4369) (1)
-.0454 .6316 .4235 .6618 .2712 .6537 .7670 1
(.2399) (.6810) (.0218) (.6145) (.3945) (.5923) (.6937) (1)
.3067 .7383 .5667 .5279 .7086 .3486 .8860 .6145 1
(.4886) (.7321) (.2420) (.8259) (.3349) (.4283) (.5482) (.7138) (1)
.1885 .7623 .3357 .0245 .2752 .2552 .8345 .4590 .6260 1
(.4832) (.9198) -(.0365) (.8378) (.2692) (.2466) (.7511) (.6310) (.6808) (1)
.1353 .0874 .3291 .3770 .4613 .5782 .3691 .4175 .5010 .0612 1
(.6736) (.7389) (.4653) (.8333) (.3572) (.5441) (.6285) (.6993) (.8183) (.7823) (1)
-.0474 .7171 .4553 .3515 .3999 .1321 .6990 .4585 .4662 .5491 -.0496 1
(.6142) (.6839) (.5150) (.4057) (.5271) (.5200) (.6760) (.5987) (.6904) (.4656) (.6687) (1)
Geriatrics
Gynecology
Heart
Urology
KidneyDisease
Neurology &
Neurosurg
Orthopedics
Respiratory
Disorders
Cancer
Digestive
Disorders
Ear-Nose-
Throat
Endo-
crinology
Note: USNWR results shown in parentheses.
3.2 Department benchmarking
Since multiple departments contribute to the overall performance of a hospital, separate
benchmarking across each specialty also can be useful. For example, Sunnetci and
Benneyan (2008) applied conventional and weight-restricted DEA models to the above
12 specialty departments examined by USNWR. For the sake of illustration, results
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
13/27
DEA models for identifying and benchmarking the best healthcare processes 317
are presented here only for the Ear, Nose, and Throat (ENT) departments. Table 6 and
Table 7 summarise the best practice ENT departments found in that study (with DEA
scores equal to 1.0) and a subset of the full results for all ENT departments, respectively,
with the USNWR results shown in parentheses.
Table 6 All ENT departments with unrestricted DEA scores = 1
DEA best-practice ENT departments
Greater Baltimore Medical Center (.217)
Hospital of the University of Pennsylvania (.490)
Johns Hopkins Hospital (1.00)
Massachusetts Eye and Ear Infirmary (.601)
Mayo Clinic (.504)
Memorial Sloan-Kettering Cancer Center (.346)
Ochsner Clinic Foundation (.213)
Ohio State University Hospital (.308)
St. Johns Mercy Medical Center (.237)
University of Alabama Hospital at Birmingham (.287)
University of California San Francisco Medical Center (.403)
University of Kentucky Chandler Hospital (.223)
M.D. Anderson Cancer Center (.543)
Note: USNWR scores shown in parentheses.
Table 7 Subset of unrestricted DEA results for ENT departments
Inputs Outputs
Hospital
DEA
(USNWR)
score
Nursing
index
Advanced
services
Patient
services Reputation Non-mortality Discharges
Weights v1 v2 v3 u1 u2 u3
T 2.2000 3.0000 5.0000 9.9000 2.1277 189.0000UCSF 1
(0.403) W 0.4546 0.0000 0.0000 0.0957 0.0000 0.0003
T 2.0000 2.8194 6.0000 24.6032 2.5062 395.9797Cleveland 0.6869
(0.493) W 0.5502 0.0000 0.0592 0.0137 0.0000 0.0028
T 1.5000 1.5000 7.0000 2.3970 13.1487 277.1343Tampa 0.4114
(0.215) W 0.7423 0.7770 0.0217 0.0000 0.0000 0.0088
T 2.10000 2.50000 5.23694 25.20444 2.77828 382.06735Univ WA 0.4999
(0.428) W 0.530663 0.354384 0.000000 0.009028 0.000001 0.004640
T 1.90000 2.50000 7.00000 40.60000 1.88679 275.00000Johns
Hopkins
1
(1) W 0.22629 0.00000 0.08143 0.02463 0.00000 0.00000
T 1.50000 2.50000 6.00000 0.00000 1 000 000.00000 74.00000Ochsner 1
(0.213) W 0.29973 0.22016 0.00000 0.00497 0.00000 0.00276
Notes: T = Target, W = Weight.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
14/27
318 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
These results again illustrate poor agreement with the USNWR findings and the common
problem of zero or irrational weights in unrestricted models (cells with italic font andgrey shading in Table 7). The University of Washington Medical Center (UWMC), for
example, places less weight on non-mortality than on reputation, whereas the University
of California Hospital in San Francisco (UCSF) places no weight on non-mortality,
advanced services, and patient services. Table 8 illustrates how results change when the
weight restriction approaches described in Section 2.3 were applied, using the bounds
shown in Column 2, with different hospitals being efficient based on the particular
approach and bound values used.
Table 8 Comparison of best-practice ENT departments for each weight-restricted model
Model Bounds used Best-practice hospitals
Basicordering
u2u
1
u2u3
Greater Baltimore Medical Center
Hospital of the University of Pennsylvania
Massachusetts Eye and Ear Infirmary
Memorial Sloan-Kettering Cancer Center
Ochsner Clinic Foundation
St. Johns Mercy Medical Center, St. Louis
M.D. Anderson Cancer Center
University of California, San Francisco Medical Center
Lowerbounds
v1 0.43 u1 0.14
v2 0.33 u2 0.58
v3 0.18 u3 0.22
Greater Baltimore Medical Center
Hospital of the University of Pennsylvania
Massachusetts Eye and Ear Infirmary
Memorial Sloan-Kettering Cancer CenterOhio State University Hospital
St. Johns Mercy Medical Center, St. Louis
UCLA Medical Center
University of Alabama Hospital at Birmingham
M.D. Anderson Cancer Center
University of Kentucky Chandler Hospital
POT u1 = 0.25(u1 + u2 + u3)
u2 = 0.50(u1 + u2 + u3)
u3 = 0.25(u1 + u2 + u3)
Massachusetts Eye and Ear Infirmary
Memorial Sloan-Kettering Cancer Center
Ochsner Clinic Foundation
Ohio State University Hospital
M.D. Anderson Cancer Center
St. Johns Mercy Medical Center, St. Louis
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
15/27
DEA models for identifying and benchmarking the best healthcare processes 319
Table 9 Hyper-efficiency results for ENT departments (USNWR), based on 1000 replications
Standard
deviation
0.1778
0.1842
0.1090
0.0924
0.1352
0.3295
0.0860
0.1296
0.1187
0.1213
0.0991
0.0782
0.0784
0.04863
0.0878
0.0839
0.0930
0.2038
0.1135
0.1120
0.1063
0.1207
0.0509
0.1127
0.1896
Average
efficiency
0.53
0.56
0.03
0.30
0.13
0.35
0.08
0.13
0.06
0.06
0.04
0.02
0.22
0.15
0.03
0.02
0.03
0.58
0.63
0.04
0.39
0.75
0.39
0.03
0.50
Percen
t
efficient(
%)
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Hospital
Mount-Sinai
Presbyterian
OhioState
Oregon
Rush
Shands
St.Francis
St.Josephs
Stanford
Tampa
UCLA
Alabama
California
SanDiego
Chicago
Iowa
Kentucky
JacksonMemorial
Michigan
Minnesota
NorthCarolina
Pittsburgh
Washington
Vanderbitt
Yale
Standard
deviation
4.5736E-12
0.3538
0.1624
0.1805
0.2415
0.2425
0.1932
0.2045
0.3262
0.1218
0.1065
0.1148
0.1526
0.4030
0.0734
0.0545
0.2038
0.0730
0.1161
0.1006
0.1413
0.2982
0.2275
0.4553
0.1169
Average
efficiency
1.00
0.69
0.55
0.80
0.84
0.53
0.73
0.21
0.36
0.31
0.29
0.28
0.38
0.44
0.56
0.34
0.21
0.60
0.50
0.44
0.35
0.32
0.23
0.40
0.11
Percent
efficient(%)
100.00
27.73
2.40
1.51
0.77
0.30
0.10
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
Hospital
Ochsner
Sloan-Kettering
SanFranMedicalCenter
M.D.Anderson
MassEyeandEar
St.JohnsMercy
Pennsylvania
Advocate
Barnes-Jewish
BethIsrael
Brigham
Charleston
Christiana
Clarian
Cleveland
Cullen
Duke
Emory
Baltimore
H.LeeMoffitt
St.Raphael
JohnsHopkins
MassGeneral
Mayo
Miami
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
16/27
320 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Figure 4 Comparison of DEA and USNWR department scores (unrestricted models) (see online
version for colours)
Ca
ncer
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Di g
estiveDis
orders
0
0.
2
0.4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Hea
rt
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Kidn
eyDiseas
e
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Ho
spital
Score
Ear-Nose-Throa
t
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Endo
crinology
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Neurolo
gyandNe
urosurge
ry
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Or
thopedics
0
0.
2
0.
4
0.
6
0.8
1
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Geriatr
ics
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Gynecolo
gy
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Re
spiratoryD
isorders
0
0.
2
0.
4
0.
6
0.
81
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospital
Score
Urology
0.
0
0.
2
0.
4
0.
6
0.
8
1.
0
1
4
7
10
13
16
19
22
25
28
31
34
37
40
43
46
49
Hospita
l
Score
US
N
DEA
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
17/27
DEA models for identifying and benchmarking the best healthcare processes 321
Table 9 summarises the efficiency score means and standard deviations for each
department from 1000 replications using uniformly distributed POT weights, with the
Ochsner Clinic Foundation in New Orleans being the only hyper-efficient hospital,
meaning it will always be on the frontier for any possible POT bounds. In contrast,
Memorial Sloan-Kettering Cancer Center and University of CA San Francisco Medical
Center are on the frontier for only 28% and 2.4% of the (gi, hj) space respectively (based
on 1000 replications), and no other ENT departments ever were efficient including
those shown in shaded cells previously on the unrestricted frontier when at least one input
or output was ignored, presumably due to the small number of replications and small
region over which they would be efficient in a larger analysis. As shown in Figure 4,
moreover, the DEA scores for all departments usually are larger than and uncorrelated
with the USNWR scores.
3.3 Benchmarking of national healthcare systems
Similar analyses can be conducted to compare entire national healthcare systems.
For example, the World Health Organization (WHO) ranked the performance of 193
countries by assigning equal weights to several dozen measures of overall health,
responsiveness, resources expended, and distribution of services (Musgrove et al., 2000),
although their study received a fair amount of criticism due to data, analysis
methodology, weighting, and fairness issues (Alan, 2001; Jamison and Sandbu, 2001;
Starfield, 2000).
Table 10 Data elements used in DEA study of national healthcare systems
Dimension Data element or surrogate measure
Care and outcomes (output) Healthy life expectancy
Adult non-mortality rate
Infant non-mortality
Morbidity surrogate measure (non-TB rate)
Equity (output) Weighted combination of urban-to-rural under five year mortalityrate, upper-to-lower wealth quartile, and none-to-high educationmother ratios (equity)
Safety (output) Non adverse event rate
Cost and resources (input) Per capita total expenditure
Doctors and nurses per 1000 capita (trained medical people)
Hospital beds per 1000Prevention (input) Surrogate measure (immunisation rate)
Demographics (input) Median age
Notes: All mortality, morbidity, and adverse event outputs converted to non-mortality,non-morbidity, and non-AE rates.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
18/27
322 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Table 11 Sample of results for unrestricted CRS output-oriented model using all 180 countries
TB
prevalence
0.00003
0.00001
0.00060
0.00073
0
0.00009
0.00003
0.00039
00.00090
0.00015
00.00030
0.00003
0.00001
0.00051
0.00002
Infant
mortality
rate
0.00412
0.0008
0.01895
0.00001
0.03357
00.017
0.00026
0.003
0.00218
0.04858
0
0.00527
0.00624
0.01862
0
0.00509
0.00071
0.0156
0.00188
Adult
mortality
rate
0.0058
0.0021
0.00770 0
0.0096
0.0116
0.0076
0.0055
0.0015
0.0122
0.0085
0.008100.007700.0070
0.0042
0.0091
0.0018O
utputs
Healthylife
expectancy
atbirth
77.78
0.12
71.17
0.559
60.94
0.283
65.067075.063056.240
0.238
78.046
0.368
70.667
0.607
73.178
0.00001
64.788
0.374
Median
age38.9
1.26
32.7
1.46
24.9
4.35232.75
42.9
0.28
19.8
4.39
38.4
1.77
28.1
2.92
36.5
1.32261.18
Im
munisation
rate
0.07
0.34
0.1767
0.8149
0.397600.2200.01
15.068
0.27200.03
39.535
0.19
0.3903
0.0633
0.5504
0.1733
1.1486
Trained
medical
people
11.9 0 2
.1
0.1134
1.3
0.0551
2.4998
0.0713
9.7704
0.0012
0.97010.3 0
3.0424
0.085
11.3 0 2
.7 0
Inputs
Percap.
spending
$2,6690$61
0.002
$27
0.001
$1640$2,662
0.0002
$13
0.003
$167
0.001
$110$2,1630$146
0.001
Beds360.011
23.11
0.007
5.9820 18 0
129.3702.368050.0020 26 0 3
30.01290.034
T W
T W
T W
T W
T W
T W
T W
T W
T W
T W
Score
0.818
0.624
0.667
1 1 0.902
0.422
0.645
0.847
0.938
Country
Canada
China
India
Jamaica
Japan
Pakistan
Russian
Federation
Turkey
USA
Venezuela
Notes:
T=Target(1strow),
W=Weight(2ndrow).
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
19/27
DEA models for identifying and benchmarking the best healthcare processes 323
As an alternate approach, Benneyan et al. (2007) and Ceyhan and Benneyan (2008)
applied DEA to a subset of these data across six healthcare system dimensions,
summarised in Table 10. In some cases, surrogate measures were used for a general
dimension (e.g., immunisation rates as a marker for preventive care), with a total of five
inputs and six outputs. All data were gathered from the WHO website1 with the exception
of the safety adverse event data, compiled from wrongdiagnosis.com. Although a
small amount of missing data were imputed via multiple regression, thirteen of the 193
countries still were eliminated because most of their data were missing, with equity and
safety measures both available for only 39 countries. Two separate analyses therefore
were conducted, first only on these 39 counties with all measures and then on again all
180 countries, in the latter case both combined and partitioned into each of the WHOs
four economic categories (based in gross national income per capita). The average
healthy life expectancy measure was treated as bounded data (using an arbitrary upper
bound of 80) and transformed via the OR approach.Table 11 summarises a sample (given space limitations) of the unrestricted DEA
results for the larger data set (i.e., without safety and equity), where the first and second
rows for each country again contain the target values and weights, respectively. One
hundred and fifteen of the 180 countries were not on the best-in-class frontier, regardless
of whether they have abundant inputs; for example, Jamaica and Japan both are efficient,
whereas the USA and Turkey both are inefficient. Table 12 summarises the reference sets
for those countries in Table 11, with the percentage weights normalised to sum to 100%
(representing the contributions of the reference countries for each particular healthcare
system to become efficient). Again note that efficient healthcare systems do not have any
others (other than themselves) in their reference sets.
Table 12 Reference sets for unrestricted CRS output-oriented model, listed in decreasing orderof weights
Country Reference set
Canada Jordan (30.8%), Sweden (24.8%), Mexico (18.3%), Oman (10.8%), Iceland(7.8%), Guatemala (7.6%)
China Syrian Arab Rep. (14.7%), Bhutan (11.0%), Eritrea (9.4%), Comoros(5.0%), Vietnam (3.6%)
India Comoros (86.4%), Cape Verde (9.8%), Uganda (2.9%), Guatemala (0.9%)
Jamaica Jamaica (100%)
Japan Japan (100%)
Pakistan Comoros (96.7%), Zambia (2.5%), Guatemala (0.7%)
Russian Federation Syrian Arab Rep. (59.9%), Oman (21.8%), Seychelles (20.5%),Singapore (2.7%)
Turkey Nicaragua (48%), Belize (43.6%), Jamaica (5.0%), Oman (3.5%)
USA Jordan (65.5%), Sweden (22.7%), Iceland (6.2%), Guatemala (4.5%),Mexico (1.2%)
Venezuela El Salvador (40.5%), Comoros (33.3%), Morocco (9.1%), Syrian ArabRep. (8.3%), Singapore (3.5%), Mexico (4.2%), Jordan (1.1%)
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
20/27
324 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Table 13 lists all countries that were efficient only in the economic group analyses
(left-hand columns) and those that were efficient in both sets of analyses (right-handcolumns), in the second case indicating a sense of strong or robust efficiency and
significant potential value in studying these national systems to gain valuable insights. In
contrast, the USA healthcare system interestingly never exhibits efficiency, presumably
because it does not transform the much higher levels of resources it consumes
into proportionally higher levels of outputs (even under VRS assumptions). Figure 5
illustrates the small correlation between rankings produced by the WHO and DEA studies
(for the CRS output-oriented unrestricted overall model). While almost statistically
significant (p = 0.5192), the agreement is fairly weak with a correlation ofR2 = 0.048.
Thirteen of the WHOs best performing countries are inefficient overall and in their
respective economic groups, with the exceptions of only Japan and Switzerland, whereas
some countries with the fewest healthcare resources and ranked poorly by the WHO are
efficient in the DEA analysis.
Table 13 Summary of efficient national healthcare systems, overall and withineconomic groups
Efficient only within economic group Efficient both within and between economic groups
Andorra
Bahrain
Brunei Darussalam
Canada
Colombia
Cuba
Democratic Republicof the Congo
Djibouti
Equatorial Guinea
Grenada
Hungary
Indonesia
Iran
Kuwait
Libyan Arab
JamahiriyaMaldives
Mauritius
Namibia
Philippines
Qatar
Republic ofKorea
Saudi Arabia
Slovakia
Uzbekistan
Venezuela
Zimbabwe
Antigua &Barbuda
Bangladesh
Belarus
Belize
Benin
Bhutan
Burundi
Cape Verde
Chile
Comoros
Costa Rica
Cyprus
CzechRepublic
Dominica
EcuadorEl Salvador
Eritrea
Ethiopia
Finland
Gambia
Guatemala
Haiti
Honduras
Iceland
Israel
Jamaica
Japan
Jordan
KyrgyzstanMalaysia
Mexico
Morocco
Mozambique
Nepal
Nicaragua
Niger
Oman
Panama
Paraguay
Rwanda
Seychelles
Sierra Leone
Singapore
Slovenia
Somalia
Spain
Sri Lanka
Swaziland
Sweden
Switzerland
SyrianArab Republic
Tajikistan
Tonga
Uganda
UnitedRepublicof Tanzania
Vietnam
Zambia
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
21/27
DEA models for identifying and benchmarking the best healthcare processes 325
Figure 5 Low correlation between DEA and WHO rankings of national healthcare systems
(see online version for colours)
Comparison of WHO vs. DEA rankings
0
50
100
150
200
0 50 100 150 200
WHO Ranking
DEA
Ranking
r2 = .0196
Comparison of WHO versus DEA rankings
Notes: R2 = 0.048,p = 0.5192.
3.4 Benchmarking at the regional state level
The same type of analysis also can be used to benchmark state and regional healthcaresystems. In 2007, the Commonwealth Fund published a comparison of the relative
performances of the healthcare systems of all US states based on a (subjectively)
weighted score card analysis of 32 measures in five dimensions of care: outcomes,
quality, access, efficiency, and equity (Cantor et al., 2007). Their general methodology
consensus-ranked the states for each measure separately, then rank ordered the systems
within each dimension based on the average of their measure ranks within that
dimension, and finally rank ordered the overall state healthcare systems based on their
average dimension ranks. As an alternative, using a subset of these data, shown in
Table 14, Table 15 summarises the results of a DEA comparison of the state healthcare
systems (Benneyan et al., 2007). Again, note that the unbounded model assigned
zero weights (italic cells) to some performance measures in order for many states to
appear efficient.
Conducting the same analysis with the weight restriction constraints listed below
results in a 12.1% average decrease in efficiency scores, with only Hawaii, Maine,
Massachusetts, Minnesota, Utah, and Vermont remaining efficient:
v4 > v1, v4 > v2, v4 > v3 (5)
v5 > v1, v5 > v2, v5 > v3 (6)
u4 > u7, u4 > u8 (7)
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
22/27
326 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
u5 > u7, u5 > u8 (8)
u9 > u11 > u6 > u5, u6 > u4 (9)
u10 > u1. (10)
Figure 6 compares the frequency that each state is in anothers reference set in the
restricted and unrestricted cases, with Hawaii and Utah being the most frequent
benchmarks in the weight restricted model, followed by Massachusetts, Minnesota,
Michigan, and Vermont. As shown in Figure 7 and in contrast to the WHO results, fairly
strong correlation exists between the CWF and weight-restricted DEA ranks (R2 = 0.687,
p = 0.000000037). In general, those states in the top quartile of the CWF study also were
efficient in the DEA analysis, with the exception of New Hampshire, Rhode Island,
Connecticut, Nebraska, and North Dakota; conversely, Utah was ranked 24th by the
CWF but was still on the DEA efficiency frontier.
Table 14 Data elements used in DEA analysis of state healthcare systems (usingCommonwealth Fund data)
Dimension Weight Element
Access(outputs)
u1
u2
u3
Adults under age 65 insured (O1)
Children insured (O2)
Adults visited a doctor in past two years (O3)
Quality(outputs)
u4
u5
u6
u7
u8
Percent of adults age 50 and older received recommended screeningand preventive care (O4)
Percent of children ages 1935 months received all recommended dosesof five key vaccines (O5)
Percent of hospitalised patients received recommended care for acutemyocardial infarction, congestive heart failure, and pneumonia (O6)
Adults with a usual source of care (O7)
Children with a medical home (O8)
Healthylives(outputs)
u9
u10
u11
Non-Mortality amenable to healthcare, deaths per 100 000population (O9)
Infant non-mortality, deaths per 1000 live births (O10)
Percent of adults under age 65 unlimited in any activitiesbecause of physical, mental, or emotional problems (O11)
Cost ofcare
(inputs)
v1
v2
v3
v4
v5
Medicare hospital admissions for ambulatory care sensitive conditionsper 100 000 beneficiaries (I1)
Medicare 30-day hospital readmissions as a percent of admissions (I2)
Percent of home health patients with a hospital admission (I3)
Total single premium per enrolled employee at private-sectorestablishments that offer health insurance (I4)
Total Medicare (Parts A and B) reimbursements per enrollee (I5)
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
23/27
DEA models for identifying and benchmarking the best healthcare processes 327
Table 15 Results of DEA analysis of state healthcare systems without weight restrictions
Ref
sets
CA
DC
AZ,HI,MD,
MA,UT
HI
MA
HI,MD,
RI,VT
CT,HI,IA,ME,
MA,
NH,RI
CI,DE,DC,HI,
MD,MN,RI
CA,IA,UT
WY
O11
10.90
0.06
10.8 0
15.17
014.23
013.70
014.18
013.47
0 14 0.03
11.38
0.064
14.5 0
O10
994
0.0021
989 0
993.2
0992 0
995.2
0994.6
0.0016
994.4
0992 0
995.2
0.0034
993.3
0
O9
99907
099840
099907
0.0004
99913
0.0004
99914
099912
099913
099894
099930
099923
0
O8 37.5
0 45.2
0 47.1
0 45.3
0 60.3
0 57.2
0 56.1
0.22
8
52.0
0 50.9
0 40.5
0.8416
O7
71.1 0
77.7 0
80.1 0
81.8 0
87.1 0
85.8 0
86.0
0.030
83.9
0.0124
82.2 0
74.9 0
O6
79.4 0
83.9 0
82.6 0
79.9 0
85.8 0
84.8 0
86.8
0.033
85.0 0
87.1 0
80.3 0
O5
77.9
0.0357
73.5
0.0251
80
0.0082
80.1
0.0654
93.5
0.0695
91.7 0
86.0
0.0012
81.5 0
82.7
0.0224
78.6 0
O4
37.4 0
45.6 0
42.019
0.1069
36.6 0
46.7 0
44.374
0.4186
44.961
045.379
0.7712
42.915
037.3 0
O3
76.7 0
91.5
0.086
86.546
0.078
88.9
0.03
90.3 0
89.8
0.056
87
0.010
88.5
0.034
82.8 0
73.9 0
O2
87 0
92.8 0
90.84
094.7 0
94.8 0
94.6 0
93.67
091.98
0.001
93.3 0
89.3 0
Outputs
O1
75.5 0
83.3 0
81.86
087.2 0
85.4 0
85.66
086.32
0.062
84.5 0
86.3 0 81 0
I57424
06312
0.00009
5937
04530
0.00022
7804
0.00013
6835.5
06014.6
05975
0.00004
6009
05323
0
I43534
04218
03328
03119
04141
03858
0.00014
3782
0.00019
3773
0.0002
3781
0.00035
3761
0.00032
I3 21.9
0.0456
27.3
0.01481
21.2
0.04402
24.7 0 29 0
27.425
029.3
0.01497
24.86
028.169
025.6 0
I2 18.2 0
20.4 0
17.4
0.0065
14.5
0.000
19.8
0.000
17.9
0.0326
15.777
0.000
16
0.0119
17.24
0.000
13.3 0
Inputs
I16383
08101
05794
04069
07830
06831
06480
06683
06156
06016
0
T W
T W
T W
T W
T W
T W
T W
T W
T W
T W
Score
1 1 0.955
1 1 0.90
0.85
0.86
0.75 1
Country
CA
DC
FL
HI
MA
NY
OH
SC
TX
WY
Notes:
T=Target,W=Weight.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
24/27
328 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
Figure 6 Frequency of state healthcare system being in a benchmark reference set (see online
version for colours)
0
4
8
12
16
20
24
28
32
AZ
CA
CO
CT
DE
DC H
IIAM
EMD
MA M
I
MN
NE
NH
NJ
NC
ND
OR
PA R
I
SD
UT
VT
WA
WV
WY
Frequency(%)
Restricted weights Unrestricted weights
Figure 7 Comparison of Commonwealth Fund and weight-restricted DEA rankings of statehealthcare systems (see online version for colours)
Comparison of State Results
0
10
20
30
40
50
60
0 10 20 30 40 50 6
The Commonwealth Fund Ranking
DEARanking
0
R = 0.687
= 0.0000000037
4 Conclusion
DEA is an effective benchmarking tool that can help identify systems and processes on
the best practice frontier, provide actionable targets to transform non-frontier systems to
best-in-class, and identify comparators that each system should study and emulate. As
such, DEA is a useful complement to other benchmarking methods and often produces
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
25/27
DEA models for identifying and benchmarking the best healthcare processes 329
different conclusions or additional insight, underscoring both its value and the value of
more quantitatively considering the amounts of input resources consumed relative to
outputs produced.
DEA adds particular value when there are multiple inputs and outputs to consider and
when the relationships and best weighting structure among them are not immediately
transparent, with the additional advantages of determining empirically achievable targets,
identifying non-frontier DMUs that never can be called best under any weighting
scheme, and discovering possibly otherwise unidentified processes of excellence that
other methods may miss. Good examples of this are the identification of Jamaica,
Pakistan, Hawaii, and Utah as having very efficient healthcare systems, while most
comparison and reform discussions tend to focus on a handful of more developed
countries or industrialised states. Many of the DEA-best hospitals identified in
Section 3.1 also typically are under-examined by Solucient, USNWR, and other popular
benchmarking studies.In contrast to these and other typical analyses that assign subjective or consensus
weighting schemes to each of several criteria, DEA determines each systems optimal
weights that maximise its score relative to the others. Since no other combination of
weights can produce a higher relative score, results can be thought of as optimistic in the
sense that DEA computes the best possible case for each DMU; conversely, any system
not on the DEA frontier can never be efficient for any other set of weights, however
chosen. An additional interpretation of the computed weights is that they somewhat
reflect each systems intrinsic tradeoff values, lending insight to management styles
and dispositions.
It also is important to understand the meaning of being on the DEA frontier and to not
misinterpret results, namely that such DMUs are the most efficient among the particular
set of DMUs being considered at transforming inputs into outputs, whereas inefficientcountries and hospitals (such as the US healthcare system and Tampa General Hospitals
ENT department) still may produce very good outcomes, just at disproportionate costs.
Since it is a relative rather than absolute measurement method, inefficient DMUs also
might perform very efficiently but just be outshone by others; conversely, the most
efficient DMUs may not exhibit much excellence but simply be the best among a bad lot.
A sufficient number of DMUs also should be used to obtain useful differentiation
between them, with a common rule-of-thumb being that it should exceed twice the total
number of input and output categories. Too few DMUs or too many inputs and outputs
can allow almost any system to appear efficient by placing most weight on a few
variables in which it might excel, greatly limiting the practical value of the analysis
(although this is less true as more weight restrictions are imposed). It also is important to
structure the data such that all inputs and outputs are independent of one another for
theoretical reasons (total operating costs and average physician salary being a possiblecounter-example). Finally, if the analyst has additional modelling insight about how
inputs are transformed into outputs, related methods such as stochastic frontier analysis
also can be appropriate.
With roughly 5700 hospitals in the USA alone, the potential to improve healthcare
systems via benchmarking is significant. Basic DEA models usually will be sufficient for
this purpose, although in some cases modelling issues such as those discussed above
necessitate alternate methods. As demonstrated, the weight restriction and OR models
offer the analyst simple solutions in such cases. Software and spreadsheet macros to
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
26/27
330 J.C. Benneyan, A. Sunnetci and M.E. Ceyhan
perform all conventional and modified analyses illustrated in this paper are available
from the lead author. Although treating the above examples with VRS or input-orientedmodels may produce different results, the primary intent here was to demonstrate the
types of analyses possible and how they can be useful to improvement activities at
department, hospital, or national levels. While not explored here, in a similar manner
DEA also can be used to benchmark the performance of individual providers, such
as cardiac surgeons (Chilingerian, 1995). Taking a different viewpoint, Feng and Antony
(2008) described using the DMAIC process to execute a DEA study, with each activity in
the analysis mapping to one of the lettered steps (Define inputs and outputs, Measure
their values, Analyse DEA results, Improve by benchmarking reference DMUs, and
Control by measuring efficiency over time).
ReferencesAksezer, C. and Benneyan, J.C. (2003) Handling missing values in data envelopment analysis,
QPL-2003-02 Research Paper, Northeastern University.
Alan, W. (2001) Science or marketing at WHO, a commentary on World Health 2000, HealthEconomics, Vol. 10, pp.93100.
Amemiya, T. (1985)Advanced Econometrics, Cambridge: Harvard University Press.
Benneyan, J.C., Ceyhan, M.E. and Sunnetci, A. (2007) Data envelopment analysis of nationalhealthcare systems and their relative efficiencies, The 37th International Conference onComputers and Industrial Engineering, pp.251261.
Benneyan, J.C. and Sunnetci, A. (2008) Handling Proportional and Bounded Values in DataEnvelopment Analysis, in review.
Benneyan, J.C., Sunnetci, A. and Aksezer, C. (2006) Identifying healthcares Toyotas: productionfrontiers, hospital quality indices, and modeling issues, International Computers in Industrial
Engineering Conference, p.3596.Burstin, H.R., Conn, A., Setnik, G., Rucker, D.W., Cleary, P.D., ONeil, A.C., Orav, E.J.,
Sox, C.M. and Brennan, T.A. (1999) Benchmarking and quality improvement: the Harvardemergency department quality study,American Journal of Medicine, Vol. 107, pp.437449.
Cantor, J.C., Schoen, C., Belloff, D., How, S.K.H. and McCarthy, D. (2007) Aiming Higher:Results from a State Scorecard on Health System Performance, The Commonwealth FundCommission on a High Performance Health System.
Ceyhan, M.E. and Benneyan, J.C. (2008) Data envelopment estimates for the most efficientnational healthcare systems given uncertain proportional rate inputs, IIE IndustrialEngineering Research Conference, pp.17601765.
Charnes, A., Cooper, W.W. and Rhodes, A. (1981) Evaluating program and managerial efficiency:an application of data envelopment analysis to program follow through, ManagementScience, Vol. 27, pp.668697.
Chenoweth, J. (2003) Benchmarking could save hospitals billions: mortality, complications couldbe reduced,Healthcare Benchmarks and Quality Improvement, p.10.
Chilingerian, J.A. (1995) Evaluating physician efficiency in hospitals: a multivariate analysis,European Journal of Operational Research, Vol. 80, No. 3, pp.548574.
Collins-Fulea, C., Mohr, J.J. and Tillett, J. (2005) Improving midwifery practice: the Americancollege of nurse-midwives benchmarking project, Journal of Midwifery & Womens Health,Vol. 50, pp.461471.
Cooper, W., Seiford, L. and Tone, K. (2000) Data Envelopment Analysis: A Comprehensive Textwith Models, Applications, References, Kluwer.
-
7/28/2019 Six Sigma Healthcare DEA Final Paper
27/27
DEA models for identifying and benchmarking the best healthcare processes 331
Doyle, J. and Green, R. (1994) Efficiency and cross-efficiency in DEA: derivations, meanings and
uses,Journal of the Operational Research Society, Vol. 45, No. 5, pp.567578.
Feng, Q. and Antony, J. (2008) Integrating data envelopment analysis into Six Sigma methodologyfor measuring physician productivity,IIE Industrial Engineering Research Conference.
Jamison, D.T. and Sandbu, M.E. (2001) Global health: WHO ranking of health systemperformance, Science, Vol. 293, pp.15951596.
McFarlane, E., Murphy, J., Olmsted, M.G., Drozd, E.M. and Hill, C. (2007) Americas besthospitals 2007 methodology, U.S. News & World Report.
McNair, C.J. and Leibfried, K.H.J. (1992) Benchmarking: A Tool for Continuous Improvement,John Wiley and Sons, Inc.
Medina-Borja, A., Pasupathy, K.S. and Triantis, K. (2007) Large-scale Data EnvelopmentAnalysis (DEA) implementation: a strategic performance management approach, Journal ofthe Operational Research Society, Vol. 58, pp.10841098.
Musgrove, P., Creese, A., Preker, A., Baeza, C., Anell, A. and Prentice, T. (2000) The World
Health Report 2000 Health Systems: Improving Performance, World Health Organization.
Sexton, T., Silkman, R. and Hogan, A. (1986) Data envelopment analysis: critique andextensions, in R. Silkman (Ed.) Measuring Efficiency: An Assessment of Data EnvelopmentAnalysis, San Francisco: Jossey Bass.
Starfield, B. (2000) Is US health really the best in the world?, Journal of American MedicalAssociation, Vol. 284, pp.483500.
Stevenson, W.J. (1998) Production and Operations Management, McGraw-Hill.
Sunnetci, A. and Benneyan, J.C. (2008) Weight restricted DEA models to identify the best U.S.hospitals,IIE Industrial Engineering Research Conference, pp.17481753.
Note
1 www.who.int/en/