modelling correlated clinical outcomes in health technology appraisal

7
ECONOMIC EVALUATION Modelling Correlated Clinical Outcomes in Health Technology Appraisal David Epstein, PhD 1,2, *, Alex Sutton, PhD 3 1 Centre for Health Economics, University of York, York, United Kingdom; 2 University of Granada, Granada, Spain; 3 University of Leicester, Leicester, United Kingdom ABSTRACT Objectives: Many clinical treatments have multiple effects that can only be effectively captured on multiple outcome scales. It might be important to understand how these outcomes are correlated to evalu- ate the effectiveness and cost-effectiveness of treatments in decision models. Methods: The probabilities are estimated that both, one, or neither outcome occurs, given estimates of the marginal probability for each outcome and information about the correlation between them. Methods are shown for different measures of association. Lower and upper bounds for the correlation coefficient are calculated for given values of the marginal probabilities. The approach is illustrated using a simplified decision model based on a recent evaluation of adalimumab, a biologic drug for psoriatic arthritis. Results: Assuming the outcomes are positively correlated, the probability of both a skin and arthritis response after adalimumab was estimated to be 0.387 (95% confidence interval 0.210 – 0.570). The incremental cost-effectiveness ratio (ICER) of adalimumab versus no biologic is £18,500 per quality-adjusted life-year (QALY). The ICER increases to £19,500 per QALY if the responses are independent. Conclusion: Estimates of ICERs can be sensitive to as- sumptions about how multiple outcomes are correlated. These as- sumptions should be explored in univariate and probabilistic sensitiv- ity analyses. Keywords: correlation, decision model, meta-analysis, outcome. Copyright © 2011, International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. Introduction Many clinical treatments have multiple effects that can only be effectively captured on multiple outcome scales. For example, bi- ologic treatments for psoriatic arthritis aim both to alleviate symptoms of psoriasis and to slow progression of joint disease. Adjuvant chemotherapy may aim to prevent recurrence of cancer but have side effects. Varicose vein surgery aims to occlude the great saphenous vein and improve the appearance or symptoms of surface varicosities. It might be important to understand how these outcomes are correlated to evaluate the overall effective- ness and cost-effectiveness of these treatments. Further clinical interventions, costs, and health-related quality of life may depend on whether patients have neither, one, or both outcomes. This article is concerned with estimating the contingent probabilities (the probability of both, one, or neither outcome occurring) for two outcomes, where each is measured on a dichotomous scale. These data can be summarized by a 2 2 table of outcomes for each treatment, as in Table 1. It is anticipated that these data will be used as parameter inputs in a decision model. Although two bi- nary outcomes are a special case, it is one that is encountered quite frequently in health technology appraisal (HTA). For exam- ple, the responses to biologic treatment for psoriatic arthritis are commonly measured using the Psoriatic Arthritis Response Crite- ria (PsARC) for joint disease and the Psoriasis Area Severity Index (PASI) 75 for skin disease [1]. Syntheses of clinical effectiveness are usually performed as a univariate meta-analysis on each outcome. Statistical methods for the meta-analysis of multiple outcomes do exist and may give improved precision in terms of univariate or marginal inferences when complete data on both outcomes are available, including the correlation between the treatment effects, which is rarely re- ported in clinical studies [2– 4]. But the key interest from meta- analyses of multiple outcomes is usually marginal treatment ef- fects and their associated uncertainty (perhaps with a joint confidence interval [CI]), rather than making inferences about the contingent probabilities [2]. Perhaps for lack of suitable data, lack of appropriate analytical methods, or, simply for convenience, some decision models omit potentially important outcomes or correlations, for example, ignoring response to the skin condition in psoriatic arthritis [5] or assuming multiple outcomes are inde- pendent [6]. The aim of this article is to demonstrate a relatively simple and pragmatic method for estimating the expected contingent proba- bilities of outcomes (the internal cell probabilities of a 2 2 table) when only the marginal probabilities for each outcome in each treatment are available, together with some measure of the asso- ciation between the outcomes. It might be noted that these con- tingent probabilities could be estimated relatively straightfor- wardly if individual patient data (IPD) were available from every study [7]. In the absence of IPD, the expected marginal probabili- ties for each outcome in each treatment (together with estimates * Address correspondence to: David Epstein, Centre for Health Economics, University of York, Heslington, York YO10 5DD, United Kingdom. E-mail: [email protected]. 1098-3015/$36.00 – see front matter Copyright © 2011, International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. doi:10.1016/j.jval.2011.04.007 VALUE IN HEALTH 14 (2011) 793–799 Available online at www.sciencedirect.com journal homepage: www.elsevier.com/locate/jval

Upload: david-epstein

Post on 29-Nov-2016

213 views

Category:

Documents


1 download

TRANSCRIPT

tunqpcr(

V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

Avai lable onl ine at www.sc iencedirect .com

journal homepage: www.elsevier .com/ locate / jva l

ECONOMIC EVALUATION

Modelling Correlated Clinical Outcomes in Health Technology AppraisalDavid Epstein, PhD1,2,*, Alex Sutton, PhD3

1Centre for Health Economics, University of York, York, United Kingdom; 2University of Granada, Granada, Spain; 3University of Leicester, Leicester, United Kingdom

A B S T R A C T

aria(issiK

C

Objectives: Many clinical treatments have multiple effects that canonly be effectively captured on multiple outcome scales. It might beimportant to understand how these outcomes are correlated to evalu-ate the effectiveness and cost-effectiveness of treatments in decisionmodels. Methods: The probabilities are estimated that both, one, orneither outcome occurs, given estimates of the marginal probability foreach outcome and information about the correlation between them.Methods are shown for different measures of association. Lower andupper bounds for the correlation coefficient are calculated for givenvalues of the marginal probabilities. The approach is illustrated using asimplified decision model based on a recent evaluation of adalimumab,

a biologic drug for psoriatic arthritis. Results: Assuming the outcomes O

ufiwcpafccoscip

pbwtctwst

Econ

al So

doi:10.1016/j.jval.2011.04.007

re positively correlated, the probability of both a skin and arthritisesponse after adalimumab was estimated to be 0.387 (95% confidencenterval 0.210–0.570). The incremental cost-effectiveness ratio (ICER) ofdalimumab versus no biologic is £18,500 per quality-adjusted life-yearQALY). The ICER increases to £19,500 per QALY if the responses arendependent. Conclusion: Estimates of ICERs can be sensitive to as-umptions about how multiple outcomes are correlated. These as-umptions should be explored in univariate and probabilistic sensitiv-ty analyses.eywords: correlation, decision model, meta-analysis, outcome.

opyright © 2011, International Society for Pharmacoeconomics and

utcomes Research (ISPOR). Published by Elsevier Inc.

Introduction

Many clinical treatments have multiple effects that can only beeffectively captured on multiple outcome scales. For example, bi-ologic treatments for psoriatic arthritis aim both to alleviatesymptoms of psoriasis and to slow progression of joint disease.Adjuvant chemotherapy may aim to prevent recurrence of cancerbut have side effects. Varicose vein surgery aims to occlude thegreat saphenous vein and improve the appearance or symptomsof surface varicosities. It might be important to understand howthese outcomes are correlated to evaluate the overall effective-ness and cost-effectiveness of these treatments. Further clinicalinterventions, costs, and health-related quality of life may dependon whether patients have neither, one, or both outcomes. Thisarticle is concerned with estimating the contingent probabilities(the probability of both, one, or neither outcome occurring) for twooutcomes, where each is measured on a dichotomous scale. Thesedata can be summarized by a 2 � 2 table of outcomes for eachreatment, as in Table 1. It is anticipated that these data will besed as parameter inputs in a decision model. Although two bi-ary outcomes are a special case, it is one that is encountereduite frequently in health technology appraisal (HTA). For exam-le, the responses to biologic treatment for psoriatic arthritis areommonly measured using the Psoriatic Arthritis Response Crite-ia (PsARC) for joint disease and the Psoriasis Area Severity IndexPASI) 75 for skin disease [1].

* Address correspondence to: David Epstein, Centre for HealthKingdom.

E-mail: [email protected]/$36.00 – see front matter Copyright © 2011, Internation

Published by Elsevier Inc.

Syntheses of clinical effectiveness are usually performed as anivariate meta-analysis on each outcome. Statistical methodsor the meta-analysis of multiple outcomes do exist and may givemproved precision in terms of univariate or marginal inferenceshen complete data on both outcomes are available, including the

orrelation between the treatment effects, which is rarely re-orted in clinical studies [2–4]. But the key interest from meta-nalyses of multiple outcomes is usually marginal treatment ef-ects and their associated uncertainty (perhaps with a jointonfidence interval [CI]), rather than making inferences about theontingent probabilities [2]. Perhaps for lack of suitable data, lackf appropriate analytical methods, or, simply for convenience,ome decision models omit potentially important outcomes ororrelations, for example, ignoring response to the skin conditionn psoriatic arthritis [5] or assuming multiple outcomes are inde-endent [6].

The aim of this article is to demonstrate a relatively simple andragmatic method for estimating the expected contingent proba-ilities of outcomes (the internal cell probabilities of a 2 � 2 table)hen only the marginal probabilities for each outcome in each

reatment are available, together with some measure of the asso-iation between the outcomes. It might be noted that these con-ingent probabilities could be estimated relatively straightfor-ardly if individual patient data (IPD) were available from every

tudy [7]. In the absence of IPD, the expected marginal probabili-ies for each outcome in each treatment (together with estimates

omics, University of York, Heslington, York YO10 5DD, United

ciety for Pharmacoeconomics and Outcomes Research (ISPOR).

tYcru

eloodmispwpat

Ys1wow

ctqo([v

0sP

uvtoFpvgtvtcm

fit

w

794 V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

of uncertainty) might be obtained from any appropriate source, forexample, a single study or estimated from univariate meta-anal-yses from several randomized, controlled trials (RCTs). An esti-mate of the correlation between these two outcomes in each treat-ment is also required. Ideally, these correlation coefficients wouldbe reported by randomized clinical studies, but this is rarely thecase, and the estimate might be obtained from other sources orexpert opinion.

There are several alternative measures of association for 2 � 2ables, for example, the tetrachoric, the phi coefficient, Yule’s Q, andule’s Y, among others. This article discusses the properties of theseoefficients and discusses how each might be used to deal with cor-elated outcomes in a decision model. The approaches are illustratedsing an example from a recent health technology appraisal [8].

The method used in this article is a simple application from anxtensive literature known as ecological inference. The aim of eco-ogical inference is to recover the internal cell counts or fractionsf 2 � 2 (or, in general, r � c) tables when the primary data reportnly the marginal frequencies, with at most only a few previousata to inform the correlation coefficient. This problem is akin toaking individual level inferences from ecological data. Ecological

nference methods have been applied to many problems in theocial sciences and epidemiology; Wakefield [9] provides a com-rehensive review. This literature, however, appears not to beell-known among practitioners of HTA, perhaps because its ap-lication is at the interface between meta-analysis and decisionnalysis. As these functions are often undertaken by differentechnical specialists, possibly neither has recognized its potential.

Methods

Measures of association

The purpose of this article is to establish a method for recoveringthe internal cell probabilities of a 2 � 2 table given aggregate dataon the marginal probabilities and some estimate of the associationbetween the variables. If there are two binary outcomes (X and Y),

Table 1 – Bivariate proportions table for binaryvariables.

Variable Y

Value 1 Value 2 Total

Value 1 a b PxVariable X Value 2 c d Qx

Total Py Qy 1

Table 2 – Frequencies (and fraction) of participants with arcontrolled trial at 12 weeks for patients in the adalimumableast 3% body skin area affected by psoriasis at baseline [1

Adalimumab

Skin responseY � 1

No skin respoY � 0

Arthritis response, X � 1 29 (0.45) 14 (0.21)No arthritis response, X � 0 5 (0.08) 18 (0.27)Total 34 (0.53) 32 (0.47)

Arthritis response measured using the Psoriatic Arthritis ResponseSeverity Index 75. The estimate of the odds ratio �x comparing the mthis trial is (0.66/0.34)/(0.25/0.75) � 5.8. The estimate of the odds ratio

adalimumab with placebo in this trial is (0.53/0.47)/(0.04/0.96) � 27.1.

then individuals can be in one of four states: X only, Y only, X and, or neither. The proportion of individuals in each of the fourtates can be summarized in a 2 � 2 table for each treatment (Table). Variable a represents the expected probability that both X and Yill occur, d represents the probability that neither X nor Y willccur, and b and c represent the probabilities that only X or only Yill occur.

For example, Table 2 shows the observed outcomes of an RCTomparing adalimumab versus placebo for psoriatic arthritis. Inhis case, full information was available on the internal cell fre-uencies as well as the total (or marginal) frequencies of eachutcome [10]. Outcome X is a binary measure of arthritis response

PsARC) and Y is a binary measure of psoriasis response (PASI 75)1].The marginal probabilities of success at 12 weeks in the inter-ention (adalimumab) group in the trial are represented as Pr1(x �

1) � Px for PsARC and Pr1(y � 1) � Py for PASI 75. In this case, theestimate of Pr1(x � 1) � 0.21 � 0.45 � 0.66 and Pr1(y � 1) � 0.08 �

.45 � 0.53. Similarly, the estimates of the marginal probabilities ofuccess at 12 weeks in the placebo group are Pr0(x � 1) � 0.25 forsARC and Pr0(y � 1) � 0.04 for PASI 75.

There are many possible measures of association that could besed to measure the strength of a relationship between two binaryariables. Warrens [11] provides an overview. For the purposes ofhis article, a suitable coefficient of association for handling binaryutcomes in a decision model should satisfy two essential criteria.irst, the measure should be defined using the four internal cellrobabilities a, b, c, and d presented in Table 1. This ensures thatalues of a, b, c, and d can be recovered by back-transformationiven estimates of the expected marginal probabilities and an es-imate of the association. Second, the coefficient should have zeroalue under statistical independence [12,13]. The following sec-ions discuss the properties of the phi coefficient, the tetrachoricoefficient, Yule’s Q, and Yule’s Y and their potential usefulness toodel correlated outcomes.

The phi coefficient

The Pearson correlation coefficient for outcomes X and Y is de-ned as the covariance of the variables divided by the product ofheir standard deviations:

� covx,y ⁄ �x�y (1)

If X and Y are binary, the measure is known as the phi coefficientand can be expressed as (see Appendix 1, found at doi:10.1016.j.val.2011.04.007) [14]:

� � �a � PxPy� ⁄ ��PxPyQxQy� (2)

here Px � a � b, Qx � 1 � Px, Py � a � c, and Qy � 1 � Py.

is and skin responses in the ADEPT randomizedup (n = 66) and placebo group (n = 69) for patients with at

Placebo

Total Skin responseY � 1

No skin responseY � 0

Total

43 (0.66) 1 (0.01) 17 (0.24) 18 (0.25)23 (0.34) 2 (0.03) 49 (0.72) 51 (0.75)n � 66 3 (0.04) 66 (0.96) n � 69

eria instrument; skin response measured using the Psoriasis Areal probability of arthritis response after adalimumab with placebo in

r the comparing the marginal probability of psoriasis response after

thritgro

0].

nse

Critargina

�y fo

g

0b

tP

(sbP

a

tattft

Reec

ce

uc

tiauja

tmttu

795V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

The phi coefficient has zero value under statistical indepen-dence [11–13,15]. From the data in Table 2 in the adalimumabroup (treatment j � 1), the sample estimate of � is r1 � 0.436. The

standard error is SE�r1� � ���1�r12�⁄�N�2�� � 0.112 and the 95% CI is

0.211 to 0.661, P � 0.0015 [16]. In the placebo group (treatment j �

), r0 � 0.035 (95% CI �0.224 to 0.264, P � 0.87). The differenceetween r1 and r0 is significant (P � 0.01) [16]. There appears to be

a strong association between the responses in the treatmentgroup and a lack of association in the placebo group.

Estimating contingent probabilities from the marginalprobabilities and the phi coefficient

The previous section reviewed how the phi coefficient is calcu-lated for two binary outcomes. This section shows how to re-cover the internal cell values of a 2 � 2 table (the contingentprobabilities a, b, c, and d) if estimates of the expected marginalprobabilities of each outcome (Px and Py) and the phi correlationcoefficient were available. This is possible because, given themarginal probabilities, there is only 1 degree of freedom re-maining, and the value of the correlation coefficient then servesto determine the 2 � 2 table of probabilities in full. For example,Pr1(x � 1) might be calculated from an estimate of the odds ofhe outcome X in the population with treatment 0 [Pr0(x � 1)/r0(x � 0)] combined with an estimate log(�x) of the log odds

ratio for outcome X of treatment 1 versus treatment 0. The logodds ratio represents the incremental effect of treatment 1 ver-sus treatment 0 on the overall risk of outcome X in the popula-tion and is assumed to be independent of the correlation �j

between outcome X and outcome Y for treatment j � 0 or j � 1.It is assumed for now that an estimate rj of the phi coefficient�j) between outcomes X and Y for treatment j is available fromome source. Given these data, the expected probability thatoth outcomes will occur in the population after treatment j, a �

rj(x � 1, y � 1), can be calculated, conditional on the marginals.Rearranging Equation 2, given fixed r, Px, and Py:

� r��PxPyQxQy� � PxPy (3)

The expected probabilities b, c, and d of the other contingentoutcomes can then easily be determined.

Although the phi coefficient has the two essential propertiesoutlined previously, it has the inconvenient property that itsmaximum and minimum values are not independent of themarginal probabilities, that is, the bounds are attenuated [11].Furthermore, even if one outcome always occurs with the otherthen the value of � can be less than 1, and it can be greater than�1 even if the outcomes never occur together (see Appendix 1,found at doi:10.1016.j.val.2011.04.007).

The phi coefficient is a widely known and used measure ofassociation in the medical literature. If the phi coefficient has beenreported by one or more clinical studies and is to be used in adecision model, then these attenuated bounds must also be incor-porated in the decision model and corresponding probabilisticsensitivity analysis. The article demonstrates how to calculatethese logical limits on the magnitude of the phi coefficient, giventhe expected marginal probabilities.

Bounds on the phi coefficient

If the row and column totals of a 2 � 2 table are known, there arelogical bounds on the four cell values within the table [17]. Giventhat Px � a � b, and probabilities take values between 0 and 1, thenhe probability of any internal cell can be no greater than the prob-bility of the row or column margin. These inequalities imply thathere are also bounds on the values that the sample estimate r ofhe phi coefficient can take that are more restrictive than �1 to �1,or given values of the marginal probabilities for each outcome for

reatment j. The lower limit lj and upper limit uj for r are (see a

Appendix 2, found at doi:10.1016.j.val.2011.04.007):

Max����odds�X�odds�Y��; � 1 ⁄ ��odds�X�odds�Y���r

Min���odds�X� ⁄ odds�Y��;��odds�Y� ⁄ odds�X���where odds�X� � Px ⁄ Qx and odds�Y� � Py ⁄ Qy.

(4)

Equation 4 does not tell us how likely it is that the mean sampleestimate rj of the phi coefficient lies at the extremes or near thecenter of this range. This uncertainty is represented by the CI of rj.

ather, it represents the logical bounds within which the phi co-fficient must lie if its value is to be consistent with previouslystimated mean values of the marginal probabilities of each out-ome. Equation 4 shows that the logical range of �j will always

cross zero (independence [13]) for all values of the marginal prob-abilities for any j. This is an example of the ecological fallacy: it isnever possible to rule out independence, negative or positive cor-relation between two variables from aggregate (marginal) dataalone [9].

The maximum and minimum logical values for a � Prj(x � 1,y � 1) and the other internal cell probabilities for treatment j can bealculated and used in sensitivity analyses. Given values for thexpected marginal probabilities Prj(x � 1) and Prj(y � 1), the lower

limit lj and upper limit uj on the correlation coefficient rj can becalculated from Equation 4 for treatment j. These values can thenbe substituted for rj in Equation 3 to calculate the corresponding

pper and lower limits of a � Prj(x � 1, y � 1) for treatment j,onditional on the marginal probabilities.

Probabilistic sensitivity analysisWhere the phi coefficient is used to model the dependency be-tween the outcomes, the bounds for phi depend on the expectedmarginal probabilities. These parameters, however, are sampleestimates, not known values, and this uncertainty is reflected inthe model through Monte Carlo (MC) simulations. The MC algo-rithm calculates the bounds for phi in each simulation and checksthat the sampled value for the phi coefficient is within thesebounds. In each MC iteration, a sample value rj of the phi coeffi-cient (assuming a normal distribution) and sample values of themarginal probabilities of each outcome in treatment j are selected.The bounds [lj, uj ] for the phi coefficient are then calculated, givenhe marginal probabilities. If the sampled rj is outside [lj,uj ], thents value is replaced by the bound. This approach to model a vari-ble whose distribution has finite bounds is analogous to onesed by Warn et al. [18]. Appendix 3, found at doi:10.1016/

.val.2011.04.007, provides a more formal description of the MClgorithm.

Other measures of association

The phi coefficient is well-known but is only one possible measure ofassociation that could be used. Table 3 compares the properties ofsome alternative measures. This section discusses the tetrachoric,Yule’s Q, and Yule’s Y. The tetrachoric correlation assumes the ob-served 2 � 2 table is a dichotomization of an underlying bivariatenormal distribution [11,15]. However, even if this assumption wereconsidered valid in a given data set, the tetrachoric coefficient is es-timated by maximum likelihood methods [15]. Its expected valuecannot be expressed as a function of probabilities a, b, c, and d (as canhe other measures discussed in this article). Therefore, given esti-

ates of the expected marginal probabilities and an estimate of theetrachoric correlation, it is not possible to back-transform to recoverhese internal cell probabilities. Consequently, this measure is notseful for the purposes of constructing a decision model using the

pproach described in this article and is not discussed further.

t

w

t

able

3–

Su

mm

ary

ofth

ep

rop

erti

esof

sele

cted

mea

sure

sof

asso

ciat

ion

for

2ta

bles

.

easu

reof

asso

ciat

ion

Cal

cula

tion

ofm

ean

valu

eV

alu

eu

nd

erst

atis

tica

lin

dep

end

ence

Min

imu

mva

lue

Max

imu

mva

lue

Esti

mat

edm

ean

(95%

CI)

inth

ead

alim

imab

grou

pfr

omth

ed

ata

inT

able

2

Esti

mat

edm

ean

(95%

CI)

inth

ep

lace

bogr

oup

from

the

dat

ain

Tab

le2

dd

sra

tio

ad/b

c1

0In

fin

ity

7.45

(2.3

0–24

.23)

1.44

(0.1

2–16

.92)

hi

��

(a–P

xP y

)/✓

(PxP y

QxQ

y)

0D

epen

den

ton

mar

gin

alp

roba

bili

ties

Dep

end

ent

onm

argi

nal

pro

babi

liti

es0.

436

(0.2

11–0

.661

)0.

035

(�0.

224

to0.

264)

etra

chor

ic*

Max

imu

mli

keli

hoo

des

tim

atio

n0

�1

10.

652

(0.3

82–0

.923

)0.

100

(�0.

599

to0.

797)

ule

’sQ

†(a

d�

bc)/

(ad

�bc

)0

�1

10.

745

(0.2

40–0

.943

)0.

181

(�0.

724

to0.

857)

ule

’sY

†(a

d1/2

�bc

1/2

)/(a

d1/2

�bc

1/2

)0

�1

10.

458

(0.1

22–0

.708

)0.

091

(�0.

428

to0.

565)

I,co

nfi

den

cein

terv

al.

Cal

cula

ted

usi

ng

the

pro

gram

Tco

rr[1

4].

Th

eco

nfi

den

cein

terv

als

for

Qan

dY

wer

ees

tim

ated

usi

ng

the

met

hod

inB

onet

tan

dPr

ice

[26]

.

796 V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

Yule’s Q is defined as:

Q � �ad � bc� ⁄ �ad � bc� (5)

Like the phi coefficient, its value is zero if X and Y are statisticallyindependent. Unlike the phi coefficient, the maximum and mini-mum values of Yule’s Q are independent of the marginal probabili-ties, that is, they are always unity and negative unity, respectively.Given fixed marginals, however, Yule’s Q is a quadratic function of a.This can be seen by substituting b � Px � a, c � Py � a, and d � 1 � Px

� Py � a in Equation 5. This implies that there may be two solutionshat solve for probability a, given fixed values of Px, Py and an estimate

r of Yule’s Q. The two solutions for a are

a � ��B � ��B2 � 4AC� � ⁄ �2A�or a � ��B � ��B2 � 4AC� ⁄ �2A��(6)

here A � 2r; B � �1 � r � 2rPx � 2rPy; C � PxPy � rPxPy.Only one of these solutions is correct (i.e., gives estimates of prob-

abilities a, b, c, and d between 0 and 1); the correct solution is the onehat satisfies the inequalities 0 a Px and 0 a Py. Yule’s Y (see

Table 3 for definition) also gives a tractable, if even more compli-cated, solution for a. Therefore, Yule’s Q and Yule’s Y appear to beappropriate measures of association between clinical outcomes andeither could in principle be incorporated in a decision model. Con-ceptually, Yule’s Q and Yule’s Y are considered approximations tothe tetrachoric coefficient, whereas the phi coefficient is what thePearson coefficient becomes when the variables are truly binary [11].The fact that the bounds of Yule’s Q and Yule’s Y are independentof the marginal probabilities might be considered an advantage overthe phi coefficient; however, Yule’s Q and Yule’s Y seem to be rarelyused in the clinical trials literature.

Illustrative example of a decision model

A simple decision model was constructed to illustrate the impactthat different assumptions about correlation between outcomescould have on estimates of costs and quality-adjusted life-years(QALYs), and the potential bias associated with assuming the out-comes are independent. The model compares the cost-effective-ness of adalimumab (a biologic therapy) versus no biologic therapyfor psoriatic arthritis [8]. The model structure is shown in Figure 1.The patient is assessed after 12 weeks of biologic therapy tomeasure the response to skin disease (using the PASI 75) andarthritis (using the PsARC). Therefore, at 12 weeks, the patientcan have responded to both, one, or neither of these compo-nents. Current guidelines recommend that biologic therapy isdiscontinued in patients whose arthritis fails to respond by thistime [1]. The model can consider alternative rules, such as dis-continuing biologic therapy only in patients who fail to respondto both PsARC and PASI 75 [19]. Illustrative assumptions aremade about the lifetime discounted costs (relative to no biologictherapy) and QALYs (relative to no biologic therapy) associatedwith each outcome at 12 weeks (Table 4). This simplified modelis not intended to support any real-life decision concerningthese therapies. A complete version of the model is presented inRodgers et al. [8].

Two RCTs compared adalimumab to placebo for skin re-sponse and arthritis response [10,20]. Univariate meta-analyseswere used to estimate the relative treatment effects and mar-ginal probabilities of each outcome for the two treatments [8](Table 5). Contingent data (Table 2) were only available from oneof these two trials [10], and it was assumed in the base-casemodel that the phi correlation coefficient was 0.436 (95% CI0.211– 0.661) for adalimumab and that outcomes were indepen-

dent for no biologic therapy.

T M O p T Y Y C * †

797V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

Results

Deterministic analysesGiven the estimates from the literature of the marginal probabili-ties for each outcome and a positive association between the out-comes with adalimumab (a phi coefficient of 0.436) (Table 5), thepredicted probability (calculated by Equation 3) of a response toboth skin and arthritis is 0.387 with adalimumab and 0.011 for nobiologic therapy (Table 6). The corresponding incremental lifetimemean total cost per patient with adalimumab versus no biologic is£25,236, the incremental QALYs are 1.368, and the incrementalcost-effectiveness ratio (ICER) is £18,453 per QALY (Table 6). Iden-tical results would be obtained in the decision model if the mea-

Fig. 1 – Structure of the illustrative model. Costs andquality-adjusted life-years of each outcome are shownrelative to those of no biologic therapy. Cost, lifetimehealth service costs arising from each outcome at 12weeks; PASI, Psoriasis Area Severity Index score,measured at 12 weeks; PsARC, Psoriatic Arthritis ResponseCriteria, measured at 12 weeks; QALY, Lifetime quality-adjusted life-years arising from each outcome at 12 weeks.

Table 4 – Incremental lifetime cost and QALY associated wcompared to never starting biologic therapy.

State at 12 wk Incremental cost, £

Patient responds to both skin andarthritis and is maintained on biologictherapy

40,000

Patient responds to arthritis only and ismaintained on biologic therapy

44,000

Patient responds to skin only and iswithdrawn from biologic therapy

2317

Patient does not respond to eitheroutcome and is withdrawn frombiologic therapy

2317

(Sensitivity analysis) Patient responds toskin only and is maintained onbiologic therapy

50,000

The values are illustrative only.

sure of association between outcomes for adalimumab wereYule’s Q with a mean value of 0.749.

Univariate sensitivity analysesIf it were assumed that the responses were independent foradalimumab and the marginal probabilities of each outcomewere unchanged, then the predicted probability of a responsewith adalimumab to both skin and arthritis is 0.280 (Table 6).The incremental cost increases to £25,665 and the incrementalQALYs decrease to 1.310, and the ICER increases to £19,532 perQALY (Table 6).

Given the marginal probabilities, the lower and upper boundson the phi correlation between the outcomes (calculated by Equa-tion 4) are �0.878 r1 0.801 for adalimumab and �0.124 r0

0.373 for placebo.Given the expected values of the marginal probabilities, the

lower limit of the phi coefficient would be �0.878 and the cor-responding probability of both a skin and arthritis response at12 weeks with adalimumab would be 0.064 (Table 6). In thiscase, the ICER would be £21,997 per QALY (Table 6). The upperlimit of the phi coefficient would be 0.801, and the correspond-ing probability of both responses at 12 weeks with adalimumabwould be 0.477. The ICER in this case would be £17,612 per QALY.

The model structure allows alternative continuation rules forbiologic therapy to be examined. Patients could be continued onbiologic therapy if they respond to either skin or arthritis at 12weeks [19]. In this case, biologic therapy would be more effectiveon average but more expensive, and the ICER (assuming a corre-lation of 0.436 between the outcomes after 12 weeks of adali-mumab) is £20,857 per QALY (Table 6). Varying the correlationcoefficient in this scenario has a greater impact on the ICER thanthe base-case continuation rule. If the outcomes were indepen-dent in this scenario, then the ICER would increase to £24,820 perQALY.

Probabilistic sensitivity analysisProbabilistic sensitivity analysis using the base-case parameters(Tables 2 and 3) estimated the mean (95% CI) incremental cost was£25,148 (15002–33342) and incremental QALY was 1.358 (0.801–1.810). The probability that adalimumab was cost-effective versusno biologic was 0.95 at a threshold of £20,000 per QALY and 1.00 ata threshold of £30,000 per QALY.

ach model state after 12 weeks with adalimumab

Incremental QALY Comment

2.5 Mean lifetime cost of biologic therapy(£54,000) less health service costsavings of £14,000

2.0 Mean lifetime cost of biologic therapy(£54,000) less health service costsavings of £10,000

0 Cost of biologic therapy for 12 weeks

0 Cost of biologic therapy for 12 weeks

0.5 Mean lifetime cost of biologic therapy(£54,000) less health service costsavings of £4000

ith e

km

798 V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

Conclusion

This article has shown a method for predicting the contingentprobabilities of both, one, or neither outcome occurring given thevalues of the marginal probabilities of each outcome and someestimate of the correlation between the outcomes. The method islimited to 2 � 2 tables, but there are many decision problems inhealth technology assessment that fall into this class.

The examples demonstrate that multivariate predictions ofboth, one, or neither outcome occurring can be sensitive to as-sumptions about the correlation between outcomes. In the exam-ple for adalimumab, the probability of achieving both a skin andan arthritis response is 0.280 if these outcomes are assumed to beindependent, but 0.387 if the outcomes are assumed to be corre-lated as observed from data from one of the RCTs.

The extent to which correlation between outcomes is importantfor predictions of long-term effectiveness and cost-effectiveness in adecision model depends on many other factors, such as whetherfurther treatment is offered or withdrawn depending on the initialoutcomes and the impact of each outcome on health-related quality

Table 5 – Data used for the illustrative decision problem: p

Parameter Valu

Odds ratio for skin response (adalimumab vs. placebo) 21Odds ratio for arthritis response (adalimumab vs. placebo) 4

Probability of skin response with placebo 0.Probability of arthritis response with placebo 0.

Probability of skin response with adalimumab 0.

Probability of arthritis response with adalimumab 0.

Phi correlation coefficient between arthritis and skinresponses with adalimumab

0.

Table 6 – Predicted contingent probabilities of skin and arteffectiveness of biologic therapy under different estimates

Expected probabilities of outcomes 12 weeks after adalimumabResponse to bothArthritis response onlySkin response onlyNeither responseCost-effectiveness of adalimumab versus placebo, assuming

patients who do not respond for arthritis are withdrawnfrom biologic therapy at 12 weeks

Incremental costsIncremental QALYICERCost-effectiveness of adalimumab versus placebo, assuming

patients who do not respond for either skin or arthritis arewithdrawn from biologic therapy at 12 weeks

Incremental costsIncremental QALYICER

In the adalimumab group, the expected marginal probability of an ara skin response is fixed at 0.477. In the no biologic therapy group, thethe expected marginal probability of a skin response is fixed at 0.044.in the base-case model are shown in bold.

ICER, incremental cost-effectiveness ratio; QALY, quality-adjusted life-yea

of life and costs. The long-term implications of alternative assump-tions about the correlation can be explored in sensitivity analysis byvarying the parameter within its lower and upper bounds and usingthe corresponding contingent probabilities within a decision analyticmodel. An illustrative example showed that the ICER of adalimumabversus no biologic was about £18,500 per QALY if the responses werestrongly positively correlated. The ICER increases to about £19,500per QALY if the responses are independent. In this case, although thetotal proportion of patients achieving an arthritis response (andtherefore continuing on adalimumab) is the same as the base case,fewer patients will be expected obtain both a skin and arthritis re-sponse if the responses are assumed to be independent. If the re-sponses were strongly negatively correlated, the ICER could be morethan £20,000 per QALY. Although negative correlation of outcomes at12 weeks is logically possible given the estimated marginal probabil-ities of each outcome, it is probably clinically implausible in this case.

There are several measures of association available for outcomesin 2 � 2 tables. The phi coefficient is simple to calculate and widely

nown. However, the lower and upper bounds for the phi coefficientay be more restricted than [�1,1] given the values of the marginal

bilities of response at 12 weeks.

ean (95% CI) Source of data

.13–74.56) Mease et al., 2005 [10]

.67–6.15) Univariate meta-analysis: Mease et al., 2005 [10];Genovese et al., 2007 [20]

.028–0.065) Mean of placebo arm in Mease et al., 2005 [10]

.178–0.317) Weighted mean of placebo arms in Mease et al.,2005 [10], Genovese et al., 2007 [20]

.275–0.693) Derived from probability of placebo responseand odds ratio (Rodgers et al., 2011 [8])

.444–0.713) Derived from probability of placebo responseand pooled odds ratio (Rodgers et al., 2011 [8])

.211–0.661) Rodgers et al., 2011 [8]

s responses to a biologic drug and estimates of cost-e association between the responses.

Estimate of phi coefficient

�0.878 0 0.436 0.801

0.064 0.280 0.387 0.4770.523 0.307 0.200 0.1100.413 0.197 0.090 0.0000.000 0.216 0.323 0.413

£26,529 £25,665 £25,236 £24,8771.206 1.314 1.368 1.413

£21,997 £19,532 £18,453 £17,612

£46,222 £34,059 £29,517 £24,8771.413 1.413 1.413 1.413

£32,724 £24,820 £20,897 £17,612

s response is fixed at 0.587 and the expected marginal probability ofted marginal probability of an arthritis response is fixed at 0.249 andssumed responses are independent with no biologic therapy. Values

roba

e m

.37 (6

.06 (2

044 (0249 (0

477 (0

587 (0

436 (0

hritiof th

thritiexpecIt is a

r.

actameem

sctdtg

mcwf

[

799V A L U E I N H E A L T H 1 4 ( 2 0 1 1 ) 7 9 3 – 7 9 9

probabilities. Therefore, if the measure of association available to theanalyst is the phi coefficient, then these bounds must be taken intoaccount when conducting univariate and probabilistic sensitivityanalyses. The maximum and minimum values of other measures ofassociation such as Yule’s Q and Yule’s Y are independent of themarginal probabilities. These alternative measures, however, do notseem to be in common use in the medical literature.

It may be difficult to obtain any data to inform the correlationbetween outcomes for each treatment. In this case, the analyst mightneed to consider nonexperimental data sources and/or elicit expertopinion. Data on the correlation between outcomes may be availablefor some but not all treatments. This was the case in a recent evalu-ation of three biologic drugs, adalimumab, etanercept, and inflix-imab [8], where the phi coefficient was reported by one (adalimumab)trial [10]. These data were not in the original trial publication andwere specially made available by the manufacturers of the drug [8]. Inthis case, it was assumed that the correlation coefficient betweenskin and arthritis responses in the treatment arm of the adalimumabtrial applied to all three biologic drugs under evaluation. This as-sumption is difficult to validate without further data, and sensitivityanalyses were performed for different values. Alternatively, the cor-relation for each treatment might be obtained from expert opinion,although methods to elicit these parameters would need to be care-fully designed. One must be careful to distinguish between two dis-tinct types of correlation. Statistical methods for the meta-analysis ofmultiple outcomes require data (or assumptions) on the correlationsor covariances between the treatment effects (as opposed to univar-iate meta-analysis, which only requires an estimate of the mean andvariance of each treatment effect) [2,21–23]. In this article, rj refers adifferent concept: the estimate of the correlation between two out-comes or responses after a single treatment j.

The approach taken here is convenient and pragmatic. It has annalogy with solving a number puzzle like Sudoku. If the row andolumn totals of a 2 � 2 table are known, there are logical bounds onhe four cell values within the table. As the marginal probabilities aressumed fixed when calculating the internal cell probabilities, theethod is rather deterministic and should be considered a useful

xploratory measure [9,17]. Some test statistics such as the Fisherxact test use a similar device, by computing the P value as if theargins of the table were known exactly [24].

Because the marginals are estimated rather than known, thathould introduce more uncertainty into the estimates of the internalell probabilities and the bounds. Ideally, all the parameters relevanto the decision model would be estimated together to capture theependency among posterior correlations, variances, and meanshat are induced by their joint estimation from the data and propa-ate these uncertainties forward into the decision model [9,22,25].

Despite its limitations, the simple and pragmatic method per-its alternative assumptions about the correlation between out-

omes to be explored. Assuming that outcomes are independenthen in fact they are correlated may bias the estimate of the ef-

ectiveness and cost-effectiveness of a therapy.

Acknowledgments

We thank the following for comments, assistance, and data:Huquin Yang, Dawn Craig, Tiago Fonseca, Lindsey Myers, LauraBojke, Mark Rodgers, Nerys Woolacott, Marta Soares, and MarkSculpher of the University of York; Ian Bruce and Robert Chalmersof University of Manchester; Sylwia Bujkiewicz of the University ofLeicester and Abbott Laboratories, United Kingdom.

Source of financial support: This work has been supported byfunding from the NIHR HTA Programme on behalf of the NationalInstitute for Clinical Excellence as project number HTA 08/42/01

and from the Medical Research Council as MRC Award G0900770.

[

Supplemental Materials

Supplemental material accompanying this article can be found inthe online version as a hyperlink at doi:10.1016/j.val.2011.04.007,or if hard copy of article, at www.valueinhealthjournal.com/issues(select volume, issue, and article).

R E F E R E N C E S

[1] Kyle S, Chandler D, Griffiths CE, et al. Guideline for anti-TNF-alphatherapy in psoriatic arthritis. Rheumatology 2005;44:390–7.

[2] Riley R. Multivariate meta-analysis: the effect of ignoring within-studycorrelation. J R Stat Soc Series A 2009,172:789–812.

[3] Ishak K, Platt R, Joseph L, Hanley J. Impact of approximating orignoring within-study covariances in multivariate meta-analyses. StatMed 2008;27:670–86.

[4] Riley R, Thompson JR, Abrams KR. An alternative model for bivariaterandom-effects meta-analysis when the within-study correlations areunknown. Biostatistics 2008;9:172–86.

[5] Bravo-Vergel Y, Hawkins N, Claxton K, et al. The cost-effectiveness ofetanercept and infliximab for the treatment of patients with psoriaticarthritis. Rheumatology 2007;46:983–8.

[6] Burch J, Epstein D, Baba-Akbari A, et al. Stapled haemorrhoidectomy(haemorrhoidopexy) for the treatment of haemorrhoids: a systematicreview and economic evaluation. Health Technol Assess 2008;12:1–214.

[7] Manca A, Lambert PC, Sculpher M, Rice N. Cost-effectiveness analysisusing data from multinational trials: the use of bivariate hierarchicalmodelling. Med Decis Making 2007;27:471–90.

[8] Rodgers M, Epstein D, Bojke L, et al. Etanercept, infliximab andadalimumab for the treatment of psoriatic arthritis: a systematic reviewand economic evaluation. Health Technol Assess 2011;15:i–xii, 1–329.

[9] Wakefield J. Ecological inference for 2x2 tables. J R Statist Soc A 2004;167:385–445.

[10] Mease PJ, Gladman DD, Ritchlin CT, et al. Adalimumab for thetreatment of patients with moderately to severely active psoriaticarthritis: results of a double-blind, randomized, placebo-controlledtrial. Arthritis Rheum 2005;52:3279–89.

[11] Warrens MJ. On association coefficients for 2�2 tables and propertiesthat do not depend on the marginal distributions. Psychometrika2008;73:777–89.

[12] Krus DJ. Visual statistics with Multimedia. Cruise Scientific, 2003.Available from: http://www.visualstatistics.net/visual%20statistics%20multimedia/crosstabulation.htm. [Accessed April 2, 2011].

[13] Mari D, Kotz S. Correlation and Dependence. London: Imperial CollegePress, 2001.

[14] Udny Yule G. On the methods of measuring association between twoattributes, J R Stat Soc 1912;579–642.

[15] Uebersax JS. The tetrachoric and polychoric correlation coefficients.Statistical Methods for Rater Agreement web site. 2006. Available from:http://john-uebersax.com/stat/tetra.htm. [Accessed April 2, 2011].

[16] Fisher RA. Frequency distribution of the values of the correlationcoefficient in samples of an indefinitely large population. Biometrika(Biometrika Trust) 1915;10:507–21.

[17] Duncan O, Davis B. An alternative to ecological correlation. Am SociolRev 1953;18:665–6.

[18] Warn DE, Thompson SG, Spiegelhalter DJ. Bayesian random effects meta-analysis of trials with binary outcomes: methods for the absolute riskdifference and relative risk scales. Stat Med 2002;21:1601–23.

[19] Smith C, Anstey A, Barker J, et al. British Association ofDermatologists’ Guidelines for Biologic Interventions for Psoriasis. Br JDermatol 2009;161:987–1019.

[20] Genovese MC, Mease PJ, Thomson GT, et al. Safety and efficacy ofadalimumab in treatment of patients with psoriatic arthritis who hadfailed disease modifying antirheumatic drug therapy. J Rheumatol2007;34:1040–50.

[21] Abrams KR, Gillies CL, Lambert PC. Meta-analysis of heterogenouslyreported trials assessing change in baseline. Stat Med 2005;24:3823–44.

[22] Ades A, Lu G. Correlations between parameters in risk models:estimation and propagation of uncertainty by Markov Chain MonteCarlo. Risk Analysis 2003;23:1165–72.

[23] Berrington A, Cox D. Generalized least squares for the synthesis ofcorrelated information. Biostatistics 2003;4:423–31.

[24] Fisher RA. On the interpretation of 2 from contingency tables, andthe calculation of P. J R Stat Soc 1922;85:87–94.

25] Sutton AJ, Kendrick D, Coupland CAC. Meta-analysis of individual-and aggregate-level data. Stat Med 2008;27:651–69.

26] Bonett DG, Price RM. Statistical inference for generalized Yulecoefficients in 2x2 contingency tables. Soc Meth Res 2007;35:429–46.