onanalytical#similarityassessment# in#biosimilar …...potentialapproach could be a similar approach...

1
ANALYTICAL SIMILARITY ASSESSMENT For analytical similarity assessment of CQAs, FDA suggests that CQAs that are relevant to clinical outcomes be identified and classified into three tiers according to their criticality or risking ranking (e.g., most, mild to moderate, and least) relevant to clinical outcomes. FDA recommends equivalence test for CQAs from Tier 1, quality range approach for CQAs from Tier 2, and descriptive raw data and graphical presentation for CQAs from Tier 3 (Christl, 2015; Tsong,2015). Equivalence Test for Tier 1 For Tier 1, FDA recommends an equivalency testing for assessment of analytical similarity be performed. As indicated by the FDA, a potential approach could be a similar approach for bioequivalence testing for generic drug products (FDA, 2003; Chow and Liu, 2008). In other words, for a given critical attribute, we may test for equivalence of the following interval (null) hypothesis: " : % < − or % > where δ > 0 is the equivalence limit (or similarity margin), and % and are the mean responses of the test (proposed biosimilar) product and the reference product lots, respectively. Analytical equivalence (similarity) is concluded if the null hypothesis of not- equivalence (notsimilarity) is rejected. Note that Yu (2004) defined in-equivalence as confidence interval falls entirely outside the equivalence limits. Similar to the confidence interval approach for bioequivalence testing under the raw data model, analytical similarity would be accepted for the quality attribute if the (1- 2α)100% two-sided confidence interval of the mean difference is within (– δ, δ). Quality Range Approach for Tier 2 For Tier 2, FDA suggests that analytical similarity be performed based on the concept of quality ranges, i.e., ± , where is standard deviation of reference product and x should be appropriately justified. Thus, the quality range of the reference product for a specific quality attribute is defined as 0 , ̂ 0 ) . Analytical similarity would be accepted for the quality attribute if a sufficient percentage of test lot values (e.g. 90%) fall within the quality range. Raw Data and Graphical Comparison for Tier 3 For CQAs in Tier 3 with lowest risk ranking, FDA recommends an approach that uses raw data/graphical comparisons. The examination of similarity for CQAs in Tier 3 by no means is less stringent, which is acceptable because they have least impact on clinical outcomes in the sense that a notable dis-similarity will not affect clinical outcomes . Estimators of Define ’56 is the log transformed measured value of the 89 item in 89 lot and it follows normal distribution with mean 5 and variance 5 ; , where 5 and 5 ; are also random variables. The expectations of 5 and 5 ; are and ; and the variances are < ; and = ; . Then the variance of reference product ; = ’56 = ’56 5 , 5 ; + ’56 5 , 5 ; = ( 5 ) + ( 5 ; )= < ; + ; Define ’5E = F G H ’56 G H 6JF , ’EE = F K ’5E K 5JF , then FDA’s estimation of ; is ; L = 1 O −1 QQ ( ’56 ’EE ) ; G H 6JF K 5JF ; L = 1 O −1 QQ ’56 ’EE G H 6JF K 5JF = < ; + ; O −1 O −1 < ; It is unbiased when O =1 . But when O >1, the FDA approach tends to underestimate ; leading to a conservative test which may not be a fair and reliable assessment of analytical similarity for a given quality attribute. Following is an alternative approach to adjust the estimate of ; to unbiased. ; L O = 1 QQ ’56 ’5E ; G S 6JF K 5JF + 1 −1 Q ’EE ’5E ; K 5JF ; L = 1 QQ ’56 ’5E ; ′ − 1 G S 6JF K 5JF < ; L = 1 −1 Q ’EE ’5E ; K 5JF 1 QQ ’56 ’5E ; ′ − 1 G S 6JF K 5JF ESTABLISHMENT OF EAC [1] FDA (2012).Scientific considerations in demonstrating biosimilarity to a reference product.The United States Food and Drug Administration, Silver Spring, Maryland, USA. [2] Christl, L. (2015). Overview of the regulatory pathway and FDA’s guidance for the development and approval of biosimilar products in the US. Presented at the Oncologic Drugs Advisory Committee meeting, January 7, 2015, Silver Spring, Maryland. [3] Chow, S.C. and Liu, J.P. (2008). Design and Analysis of Bioavailability and Bioequivalence Studies .3 rd edition, Chapman Hall/CRC Press, Taylor& Francis, New York, New York, USA. [4] Chow, S.C. (2015). Challenging Issues in Assessing Analytical Similarity in Biosimilar Studies. Biosimilars, 5, 1-7. [5] FDA (2003). Guidance on Bioavailability and Bioequivalence Studies for Orally Administrated Drug Products – General Considerations, Center for Drug Evaluation and Research, the US Food and DrugAdministration, Rockville, Maryland, USA. [6]. Haidar, S.H., Davit, B., Chen, M.L., Conner, D., Lee, L., Li, Q.H., Lionberger, R., Makhlouf, F., Patel, D., Schuirmann, D.J., and Yu, L.X. (2008). Bioequivalence approaches for highly variable drugs and drug products. Pharmaceutical Research, 25: 237-241. SIMULATION RESULTS The empirical distribution of O L and T under the same assumption is shown below. Every estimate is obtained using 10 references lots with sample size 3. As an example, suppose that there are 20 RP lots and 6 TP lots. We first randomly select 6 out of the 20 RP lots to match the 6 TP lots. Also, suppose that the true difference between the biosimilar product and the reference product is proportional to σ R . From result above, the tests using estimated are more likely to conclude that the test product is similar to reference product than the test using the true population , which leads to an inflated alpha. For instance, when the difference is 2 σ R , i.e . the null hypothesis is true, the type 1 error of test using population is 10.09%. While, the type 1 error for the other two tests is respectively 30.5% and 29.8%. REFERENCES For analytical similarity assessment of CQAs in Tier 1, FDA recommends equivalence test under the null hypothesis. For testing the null hypothesis, FDA made the following assumptions. First, FDA assumes that the difference in mean responses between the reference product and the proposed biosimilar product is proportional to the variability of the reference product. In other words, = % . Based on this assumption, FDA indicates that the equivalence limit is likely to be proportional the reference product variability as well, which means which means =c* . FDA chose c=1.5. However when establishing equivalency acceptance criterion (EAC= 1.5* ), FDA consider the estimate of reference product variability obtained from the reference sample as a fixed constant, which may compromise the accuracy of the bioequivalence test. On Analytical Similarity Assessment in Biosimilar Studies Tongrong Wang, MS 1 and Shein-Chung Chow PhD 2 1 Master Candidate, Department of Biostatistics and Bioinformatics, Duke University School of Medicine 2 Professor, Department of Biostatistics and Bioinformatics, Duke University School of Medicine ABSTRACT Abstract For assessment of biosimilarity of biosimilar products, the United States (US) Food and Drug Administration (FDA) proposed a stepwise approach for providing totality-of-the-evidence of similarity between a proposed biosimilar product and a US-licensed (reference) product. The stepwise approach starts with assessment of critical quality attributes that are relevant to clinical outcomes in structural and functional characterization in manufacturing process of the proposed biosimilar product. FDA suggests that these critical quality relevant attributes be identified and classify into three tiers depending their criticality or risking ranking. To assist the sponsors, FDA also suggests some statistical approaches for assessment of analytical similarity for critical quality attributes (CQAs) from different tiers, namely equivalence test for Tier 1, quality range approach for Tier 2, and descriptive raw data and graphical comparison for Tier 3. Analytical similarity assessment for CQAs in Tier 1 is performed based on the equivalence acceptance criterion (EAC) which depends upon the estimate of variability of the reference product. The FDA’s recommended approach often underestimates the variability of the reference product because it does not take the worst possible lots into consideration. In this poster, the statistical properties of the FDA’s recommended approach is examined and alternative methods will be proposed in establishing a more accurate and reliable EAC for analytical similarity assessment. Keywords: Stepwise approach; Critical quality attribute (CQA); Equivalence test; Equivalence acceptance criterion (EAC). CONTACT Tongrong Wang Email: [email protected] Number of TP lots % Test size (Confidence Interval) Statistical Power using FDA estimated Statistical Power using modified estimated Statistical Power using population 6 0 5%(90%) 98.68% 98.62% 97.63% 6 1/8 5%(90%) 90.77% 90.53% 89.22% 6 1/4 5%(90%) 89.87% 90.22% 87.96% 6 1/2 5%(90%) 86.51% 86.23% 82.21% 6 1 5%(90%) 71.76% 71.28% 61.48%

Upload: others

Post on 07-Apr-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: OnAnalytical#SimilarityAssessment# in#Biosimilar …...potentialapproach could be a similar approach for bioequivalence testing for generic drug products (FDA, 2003; Chow and Liu,

ANALYTICAL  SIMILARITY  ASSESSMENT

For analytical similarity assessment of CQAs, FDA suggests thatCQAs that are relevant to clinical outcomes be identified andclassified into three tiers according to their criticality or riskingranking (e.g., most, mild to moderate, and least) relevant to clinicaloutcomes. FDA recommends equivalence test for CQAs from Tier 1,quality range approach for CQAs from Tier 2, and descriptive rawdata and graphical presentation for CQAs from Tier 3 (Christl, 2015;Tsong, 2015).

Equivalence Test for Tier 1For Tier 1, FDA recommends an equivalency testing for assessmentof analytical similarity be performed. As indicated by the FDA, apotential approach could be a similar approach for bioequivalencetesting for generic drug products (FDA, 2003; Chow and Liu,2008). In other words, for a given critical attribute, we may test forequivalence of the following interval (null) hypothesis:

𝐻": 𝜇% − 𝜇' < −𝛿 or 𝜇% − 𝜇' > 𝛿where δ > 0 is the equivalence limit (or similarity margin), and 𝜇%

and 𝜇' are the mean responses of the test (proposed biosimilar)product and the reference product lots, respectively. Analyticalequivalence (similarity) is concluded if the null hypothesis of not-equivalence (notsimilarity) is rejected. Note that Yu (2004) definedin-equivalence as confidence interval falls entirely outside theequivalence limits. Similar to the confidence interval approach forbioequivalence testing under the raw data model, analyticalsimilarity would be accepted for the quality attribute if the (1-2α)100% two-sided confidence interval of the mean difference iswithin (– δ, δ).

Quality RangeApproach for Tier 2For Tier 2, FDA suggests that analytical similarity be performed

based on the concept of quality ranges, i.e., 𝜇' ± 𝑥𝜎, where 𝜎 isstandard deviation of reference product and x should beappropriately justified. Thus, the quality range of the referenceproduct for a specific quality attribute is defined as (�̂�' −𝑥𝜎0', �̂�' − 𝑥𝜎0'). Analytical similarity would be accepted for thequality attribute if a sufficient percentage of test lot values (e.g.90%) fall within the quality range.

Raw Data andGraphical Comparison for Tier 3For CQAs in Tier 3 with lowest risk ranking, FDA recommends anapproach that uses raw data/graphical comparisons. Theexamination of similarity for CQAs in Tier 3 by no means is lessstringent, which is acceptable because they have least impact onclinical outcomes in the sense that a notable dis-similarity will notaffect clinical outcomes.

Estimators  of  𝝈𝑹

Define 𝑥'56 is the log transformed measured value of the 𝑗89 item in 𝑖89lot and it follows normal distribution with mean 𝜇5 and variance 𝜎5;, where 𝜇5 and 𝜎5; are also random variables. The expectations of 𝜇5 and 𝜎5; are 𝜇 and 𝜎; and the variances are 𝜎<; and 𝜎=;. Then the variance of reference product𝜎'; = 𝑉𝑎𝑟 𝑥'56 = 𝑉𝑎𝑟 𝐸 𝑥'56 𝜇5, 𝜎5; + 𝐸 𝑉𝑎𝑟 𝑥'56 𝜇5,𝜎5;

= 𝑉𝑎𝑟(𝜇5) + 𝐸(𝜎5;) =   𝜎<; + 𝜎;

Define 𝑥'5E =FGH∑ 𝑥'56GH6JF , 𝑥'EE =

FK∑ 𝑥'5EK5JF ,

then FDA’s estimation of 𝜎'; is

𝜎';L =1

𝑛O𝑘 − 1QQ(𝑥'56 − 𝑥'EE);

GH

6JF

K

5JF

𝐸 𝜎';L =1

𝑛O𝑘 − 1QQ 𝑉𝑎𝑟 𝑥'56 − 𝑉𝑎𝑟 𝑥'EE

GH

6JF

K

5JF

= 𝜎<; + 𝜎; −𝑛O − 1𝑛O𝑘 − 1𝜎<

;

It is unbiased when 𝑛O = 1. But when 𝑛O > 1, the FDA approach tends to underestimate 𝜎'; leading to a conservative test which may not be a fair and reliable assessment of analytical similarity for a given quality attribute. Following is an alternative approach to adjust the estimate of 𝜎'; to unbiased.

𝜎';LO=

1𝑛′𝑘

QQ 𝑥'56 − 𝑥'5E;

GS

6JF

K

5JF

+1

𝑘 − 1Q 𝑥'EE − 𝑥'5E ;K

5JF

𝜎;L =1𝑘QQ

𝑥'56 − 𝑥'5E;

𝑛′ − 1

GS

6JF

K

5JF

𝜎<;L =1

𝑘 − 1Q 𝑥'EE − 𝑥'5E ;K

5JF

−1𝑘𝑛′

QQ𝑥'56 − 𝑥'5E

;

𝑛′ − 1

GS

6JF

K

5JF

ESTABLISHMENT  OF  EAC

[1] FDA (2012).Scientific considerations in demonstrating biosimilarity to a referenceproduct.The United States Food and Drug Administration, Silver Spring, Maryland, USA.[2] Christl, L. (2015). Overview of the regulatory pathway and FDA’s guidance for thedevelopment and approval of biosimilar products in the US. Presented at the OncologicDrugs Advisory Committee meeting, January 7, 2015, Silver Spring, Maryland.[3] Chow, S.C. and Liu, J.P. (2008).Design and Analysis of Bioavailability andBioequivalence Studies.3rd edition, Chapman Hall/CRC Press, Taylor& Francis, New York,New York, USA.[4] Chow, S.C. (2015). Challenging Issues in Assessing Analytical Similarity in BiosimilarStudies. Biosimilars, 5, 1-7.[5] FDA (2003). Guidance on Bioavailability and Bioequivalence Studies for OrallyAdministrated Drug Products – General Considerations, Center for Drug Evaluation andResearch, the US Food and Drug Administration, Rockville, Maryland, USA.[6]. Haidar, S.H., Davit, B., Chen, M.L., Conner, D., Lee, L., Li, Q.H., Lionberger, R.,Makhlouf, F., Patel, D., Schuirmann, D.J., and Yu, L.X. (2008). Bioequivalence approachesfor highly variable drugs and drug products. Pharmaceutical Research, 25: 237-241.

SIMULATION RESULTS

The empirical distribution of 𝝈𝑹OL and 𝝈𝑹T under the same assumptionis shown below. Every estimate is obtained using 10 references lotswith sample size 3.

As an example, suppose that there are 20 RP lots and 6 TP lots. Wefirst randomly select 6 out of the 20 RP lots to match the 6 TP lots.Also, suppose that the true difference between the biosimilarproduct and the reference product is proportional to σR.

From result above, the tests using estimated 𝝈𝑹 are more likely toconclude that the test product is similar to reference product than thetest using the true population 𝝈𝑹, which leads to an inflated alpha. Forinstance, when the difference is 2 σR , i.e . the null hypothesis is true,the type 1 error of test using population𝝈𝑹 is 10.09%. While, the type 1error for the other two tests is respectively 30.5% and 29.8%.

REFERENCES

For analytical similarity assessment of CQAs in Tier 1, FDArecommends equivalence test under the null hypothesis. For testingthe null hypothesis, FDA made the following assumptions. First,FDA assumes that the difference in mean responses between thereference product and the proposed biosimilar product isproportional to the variability of the reference product. In otherwords, ∆ = 𝜇% − 𝜇'∝𝜎'. Based on this assumption, FDA indicatesthat the equivalence limit is likely to be proportional the referenceproduct variability as well, which means which means 𝛿=c*𝜎'.FDA chose c=1.5. However when establishing equivalencyacceptance criterion (EAC= 1.5*𝜎'), FDA consider the estimate ofreference product variability obtained from the reference sample asa fixed constant, which may compromise the accuracy of thebioequivalence test.

On  Analytical  Similarity  Assessment  in  Biosimilar StudiesTongrong Wang, MS1 and Shein-Chung Chow PhD21 Master Candidate, Department of Biostatistics and Bioinformatics, Duke University School of Medicine 2 Professor, Department of Biostatistics and Bioinformatics, Duke University School of Medicine

ABSTRACT

AbstractFor assessment of biosimilarity of biosimilar products,the United States (US) Food and Drug Administration(FDA) proposed a stepwise approach for providingtotality-of-the-evidence of similarity between aproposed biosimilar product and a US-licensed(reference) product. The stepwise approach startswith assessment of critical quality attributes that arerelevant to clinical outcomes in structural andfunctional characterization in manufacturing processof the proposed biosimilar product. FDA suggeststhat these critical quality relevant attributes beidentified and classify into three tiers depending theircriticality or risking ranking. To assist the sponsors,FDA also suggests some statistical approaches forassessment of analytical similarity for critical qualityattributes (CQAs) from different tiers, namelyequivalence test for Tier 1, quality range approach forTier 2, and descriptive raw data and graphicalcomparison for Tier 3. Analytical similarityassessment for CQAs in Tier 1 is performed based onthe equivalence acceptance criterion (EAC) whichdepends upon the estimate of variability of thereference product. The FDA’s recommendedapproach often underestimates the variability of thereference product because it does not take the worstpossible lots into consideration. In this poster, thestatistical properties of the FDA’s recommendedapproach is examined and alternative methods will beproposed in establishing a more accurate and reliableEAC for analytical similarity assessment.

Keywords:Stepwise approach; Critical quality attribute (CQA);Equivalence test; Equivalence acceptance criterion(EAC).

CONTACT

Tongrong WangEmail: [email protected]

Number of TP lots

𝜇% − 𝜇'𝜎'

Test size (Confidence

Interval)

Statistical Power using

FDA estimated 𝜎'

Statistical Power using

modified estimated 𝜎'

Statistical Power using

population 𝜎'

6 0 5%(90%) 98.68% 98.62% 97.63%6 1/8 5%(90%) 90.77% 90.53% 89.22%6 1/4 5%(90%) 89.87% 90.22% 87.96%6 1/2 5%(90%) 86.51% 86.23% 82.21%6 1 5%(90%) 71.76% 71.28% 61.48%