STATISTICS Year : 2022  Volume : 13  Issue : 2  Page : 114117 Equivalence trials Priya Ranganathan^{1}, CS Pramesh^{2}, Rakesh Aggarwal^{3}, ^{1} Department of Anaesthesiology, Tata Memorial Centre, Homi Bhabha National Institute, Mumbai, Maharashtra, India ^{2} Division of Thoracic Surgery, Tata Memorial Centre, Homi Bhabha National Institute, Mumbai, Maharashtra, India ^{3} Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India Correspondence Address: In an equivalence trial, researchers aim to show that a new intervention is more or less similar to an existing standard of care, i.e., the two are “equivalent”. In this article, we discuss some aspects of the design, analysis, interpretation and reporting of equivalence trials.
Examples of Equivalence Trials in the Published Literature McCann hypothesized that neurodevelopment at 5 years of age would be equivalent in children who received either general anesthesia or regional anesthesia for surgical procedures during their infancy.[2] Equivalence was defined as a difference of no more than 5 points on an intelligence quotient score.Marzocchi compared tirofiban with abciximab for facilitated angioplasty in patients with STelevation myocardial infarction.[3] The primary hypothesis was that tirofiban would be equivalent to abciximab in achieving complete STsegment resolution. The margin of clinical equivalence between the two drugs was fixed at 10% in the proportion of patients achieving complete STsegment resolution.Cunningham evaluated the equivalence of starting oxygen therapy in infants with bronchiolitis when oxygen saturation had reached below 90% versus doing so when it had reached below 94%.[4] The primary outcome for equivalence was time to resolution of cough, with limits of equivalence defined as ± 2 days. Equivalence trials are used to show that two interventions are not unacceptably different and that one can be substituted for another with nearly similar efficacy while offering other advantages, for example, lower cost, reduced toxicity, or greater ease of administration. Hypothesis Testing and Statistical Errors in Equivalence Trials In equivalence trials, we start with the null hypothesis that the difference between the experimental arm and the comparator arm will be greater than the predefined margin of equivalence (d). We use a twosided approach, which means that we allow for either of the arms to be superior (and the difference between experimental and comparator arms to be greater than +d or −d). The alternate hypothesis is that the difference between the arms will be less than d (difference ranging between +d and −d). [Figure 1] illustrates the null and the alternate hypothesis in an equivalence trial.{Figure 1} The type 1 error in an equivalence trial is the risk of falsely rejecting the null hypothesis; this means that we accept equivalence when the treatments are, in fact, not equivalent. This is usually set at 5%, which means that we need to be 95% confident that the difference between the treatments does not exceed d in either direction. The type 2 error in an equivalence trial is the risk of falsely rejecting the alternate hypothesis; this means that we fail to detect equivalence where it exists. The Confidence Interval Approach We use the confidence interval (CI) approach to express the results of an equivalence study. To prove equivalence, the CI for the difference between the interventions must lie within the boundaries of − d to + d. The determination of CI may be done in one of two ways:[5] The two onesided test (TOST) approach uses TOSTs with a 5% significance level each (corresponding to a twosided 90% CI)One could use a single twosided test with a 5% significance level (corresponding to a twosided 95% CI). [Figure 2] shows various possible results for an equivalence study.{Figure 2} Choosing the Margin of Equivalence The margin of equivalence “d” is set as the largest difference that would still allow the two interventions to be considered “nearly” equivalent. It is to be noted that “equivalence does not mean 100% absolute equivalence but that despite some small difference (<d), the two interventions are clinically indistinguishable.”[6] Superiority trials differ from equivalence trials in that they aim to prove that an intervention is substantially different from an existing standard. This is in contrast to an equivalence trial where the aim is to show that there is no substantial difference between interventions. The margin of difference in a superiority trial is, therefore, chosen based on the least difference that would be considered clinically meaningful and is, therefore, often larger than the margin that would be chosen in an equivalence trial. Difference Between Noninferiority and Equivalence Trials Noninferiority (NI) trials set out with the hypothesis that the experimental arm is slightly inferior to the comparator (by no more than the predefined margin of inferiority) but with some ancillary benefits. These trials use a onesided approach for testing and do not investigate whether the experimental arm may be actually superior to the comparator. In contrast, equivalence trials have the objective of showing that the experimental arm is no different, i.e. only slightly different (superior or inferior) to the comparator (by no more than the predefined margin of equivalence). These studies use a twosided approach for testing. Choice of Control in Equivalence Trials For both NI and equivalence trials, the choice of control is a crucial aspect of the validity of the study. Superiority trials establish the superiority of a new intervention over the comparator; therefore, they may use a placebo as a control (if there is no active standard of care). By contrast, equivalence and NI trials aim to prove that the new intervention is equivalent or noninferior to a comparator and are undertaken only where there is a welldefined standard of care for a particular condition. For such comparisons, it is essential that the control that is chosen is already proven to be superior to placebo under similar conditions. If the comparator has not been validated in a placebocontrolled study, then the results of the NI or equivalence study are uninterpretable since equivalence means that the new intervention may be just as good as a placebo. Negative Results in Superiority Trials Do Not Imply Equivalence Let us take the hypothetical example of a randomized trial to compare two antiemetics, A and B, for the primary outcome of the proportion of participants having postoperative vomiting. [Figure 3] shows two possible results of this study. In scenario 1, the difference between treatments is 10% with 95% CIs of 5% to 15%. This suggests that we are 95% confident that treatment A is superior to treatment B, with the treatment benefit ranging from at least 5% to as much as 15%. In scenario 2, the difference between treatments is 10%, with 95% CIs ranging from −5 to 25%; this would be interpreted as the true treatment effect ranging anywhere from a 5% detriment to 25% benefit. This would not be a statistically significant result since the CIs for the difference include zero, suggesting that there may be no difference between the interventions, and in fact, the possibility of treatment A being worse than treatment B cannot be ruled out. It would be incorrect to conclude that since A is not superior to B, the two treatments are equivalent since A could be (a) worse than B by 5% or (b) better than B by 25%. We have not predefined the margin that we would consider equivalent and, therefore, cannot assume that absence of evidence of superiority is evidence of equivalence.{Figure 3} Sample Size For Equivalence Trials Similar to comparative superiority trials, the sample size for an equivalence study depends on the margin of equivalence and the type 1 and 2 errors. Since the margin “d” is generally kept fairly small, the sample size for an equivalence study tends to be much larger than for a similar comparison in a superiority study. Analysis and Reporting of Results of Equivalence Studies Since intentiontotreat analysis tends to minimize differences between interventions (which could lead to an erroneous conclusion of equivalence), researchers should present the results of both the intentiontotreat and the perprotocol analysis of such trials.[7] The CONSORT statement for the reporting of clinical trials has an extension for the reporting of NI and equivalence trials.[8] Bioequivalence Trials Bioequivalence studies are a special type of equivalence studies whose aim is to show that two drugs (often two formulations of the same drug) are comparable with respect to one or more pharmacokinetic properties, for example, peak or trough plasma levels, or plasma level at a specified time point. These parameters are important when one drug is to be substituted for another, as in the case of biosimilars. Bioequivalence studies, though similar in principle to equivalence studies, are usually carried out as crossover studies and thus use statistical procedures for paired data. Financial support and sponsorship Nil. Conflicts of interest There are no conflicts of interest. References


