14  Randomization and Blinding

Randomization and blinding are often taken for granted, yet the methodological history of medicine contains numerous examples of treatments that seemed effective in uncontrolled studies but proved worthless—or even harmful—when properly tested.

Consider a simple scenario. A physician enthusiastic about a new treatment prescribes it to patients who seem likely to benefit and withholds it from those who seem too sick to respond. When the treated patients do better, is it because the treatment works, or because healthier patients were selected for treatment?

This is not a hypothetical concern. For decades, hormone replacement therapy was thought to prevent heart disease based on observational studies showing that women who took hormones had lower rates of heart attacks. When randomized trials were finally conducted, they showed the opposite—hormone therapy slightly increased cardiovascular risk. The observational studies had been confounded: women who chose to take hormones were healthier to begin with.

Randomization eliminates this confounding (Cox 2009). When a coin flip (or its statistical equivalent) determines who receives treatment and who receives control, the groups are comparable at baseline. Any differences that emerge can be attributed to the treatment itself. Cox identifies three broad purposes for randomization: to avoid selection and other biases in a publicly convincing way; to provide a unified approach to estimating error in standard designs; and to provide a basis for exact tests of significance. Of these, bias avoidance is the most compelling in clinical trials—failure to randomize appropriately may fatally compromise an investigation. Moreover, randomization convinces those not directly involved in the trial that the comparison is fair.

14.1 Types of Randomization

Table 14.1 compares the major randomization strategies and their appropriate use cases.

Table 14.1: Randomization Methods Comparison
Method Description Advantages Disadvantages Best For
Simple Pure random allocation (coin flip) Simple; Unpredictable Possible group imbalance Large trials (n > 200)
Block Random within fixed-size blocks Equal group sizes; Predictable Potential unblinding at block ends Most trials
Stratified Separate randomization by strata Balance on key factors Complexity increases with factors Trials with strong prognostic factors
Minimization Dynamic allocation based on current imbalance Excellent balance; Multiple factors Slightly predictable; Software needed Multi-center trials; Many covariates
Cluster Randomize groups, not individuals Practical for interventions affecting groups Reduced power; Contamination concerns Community/system interventions

The simplest approach is simple randomization—essentially flipping a coin for each participant. This works well for large trials but can produce unequal group sizes in smaller studies. By chance, a trial randomizing 20 patients might end up with 13 in one group and 7 in the other. Bradley Efron addressed this problem in 1971 with the biased coin design, which increases the probability of assigning to the underrepresented group when imbalance develops (Efron 1971).

Block Randomization

Block randomization addresses the imbalance problem by randomizing within blocks of fixed size, as illustrated in Figure 14.1.

flowchart LR
    subgraph Block1["Block 1"]
        P1["Patient 1"] --> A1["Treatment A"]
        P2["Patient 2"] --> B1["Treatment B"]
        P3["Patient 3"] --> A2["Treatment A"]
        P4["Patient 4"] --> B2["Treatment B"]
    end
    
    subgraph Block2["Block 2"]
        P5["Patient 5"] --> B3["Treatment B"]
        P6["Patient 6"] --> A3["Treatment A"]
        P7["Patient 7"] --> B4["Treatment B"]
        P8["Patient 8"] --> A4["Treatment A"]
    end
    
    Block1 --> R1["After Block 1:<br/>A=2, B=2"]
    Block2 --> R2["After Block 2:<br/>A=4, B=4"]
Figure 14.1: Block randomization ensures equal group sizes after each complete block. This example shows a block size of 4.

With blocks of 4, for example, each block might have the pattern ABAB, ABBA, BABA, or another of the six possible arrangements with two A’s and two B’s. After every complete block, the groups are guaranteed to be equal in size.

Stratified randomization goes further by randomizing separately within subgroups defined by important prognostic factors. If disease severity is a major predictor of outcome, patients might be stratified as mild, moderate, or severe, with randomization occurring independently within each stratum. This ensures that treatment groups are balanced on severity, even if overall enrollment is skewed toward one level.

Minimization (or dynamic allocation) takes a different approach (Taves 1974; Pocock and Simon 1975). Rather than randomizing each patient independently, minimization algorithms assign each new patient based on the current imbalance among enrolled patients. If the treatment group currently has more older patients, the next older patient may be somewhat more likely to be assigned to control. This approach can achieve excellent balance on multiple factors simultaneously—Taves’ original simulations showed a four- to fivefold reduction in the probability of severe imbalance compared to standard randomization (Scott et al. 2002). A comprehensive 2023 tutorial demonstrated that minimization is particularly beneficial when numerous major prognostic factors are known, when many centers of varying sizes recruit patients, or when the trial’s sample size is moderate (Coart et al. 2023).

14.2 Maintaining Randomization Integrity

The value of randomization depends on its integrity. If investigators can predict or influence which treatment a patient will receive, the protection against bias is lost.

Allocation concealment is the mechanism that prevents this. In older trials, numbered envelopes were used—sometimes inadequately, leading to investigators holding envelopes up to light to see the assignment inside. Modern trials use Interactive Response Technology (IRT) systems—computer or telephone systems that reveal the assignment only after the investigator has committed to enrolling a specific patient.

Once randomization occurs, the assignment should be followed. Treatment crossover—where patients switch from their assigned group to another—can blur the distinction between groups and complicate analysis. When crossover is unavoidable (as when patients on placebo deteriorate and ethical obligations require access to active treatment), appropriate analytical methods must be used.

14.3 The Power of Blinding

If randomization protects against bias in allocation, blinding protects against bias in everything that follows.

Consider a trial where investigators know which patients are receiving the experimental drug. They might, consciously or unconsciously, give those patients extra attention, monitor them more closely, interpret symptoms more charitably, or encourage them to continue despite side effects. Patients who know they are receiving active treatment might report more improvement due to expectations.

When outcomes are subjective—pain, mood, quality of life—these biases can be substantial. Even supposedly objective outcomes can be affected: a radiologist who knows a patient is on active treatment might interpret a borderline scan as showing improvement.

Blinding addresses these concerns by ensuring that treatment assignment is unknown to those who might be influenced by it.

14.4 Levels of Blinding

Trials utilize varying levels of blinding depending on the research question and practical constraints. Open-label trials make no attempt at concealment and are often used when comparing radically different interventions, such as surgery versus medication, though they are highly susceptible to observer bias. Single-blind trials reduce participant expectations by concealing the assignment from patients, while double-blind trials—the industry standard—conceal assignments from both patients and investigators to protect against subjective interpretation of results. In the most rigorous triple-blind designs, the assignment is also withheld from those performing the statistical analysis until the study is formally concluded.

14.5 Technical Aspects of Blinding

Achieving and maintaining the blind requires careful attention to multiple details.

Treatment and placebo must be indistinguishable. This means matching appearance (color, size, shape), taste, smell, and texture. If the active drug comes as a capsule and the comparator is a tablet, both may need to be over-encapsulated to create matching appearances.

Unblinding procedures must be in place for emergencies. If a participant has a medical crisis and the treating physician needs to know the treatment assignment, there must be a mechanism to break the blind for that individual while maintaining it for everyone else.

Side effects can compromise blinding. If the experimental drug causes a distinctive adverse effect—dry mouth, dizziness, skin discoloration—participants and investigators may guess the assignment. Trials sometimes include perception of assignment assessments to evaluate whether blinding was maintained.

When true blinding is not possible, blinded assessment provides an alternative. An assessor who has no contact with patients other than performing specific evaluations may remain blind even when other study staff cannot. Imaging studies can be read by radiologists who have no knowledge of treatment assignment.

14.6 The Challenge of Device and Behavioral Trials

Blinding is particularly challenging in trials of devices, surgical procedures, and behavioral interventions. How do you blind a patient to whether they received surgery? How do you blind a therapist to whether they are delivering cognitive behavioral therapy or a control intervention?

Sham procedures provide one approach. In studies of surgical interventions, patients in the control group may receive anesthesia and incisions but not the actual procedure. This is ethically controversial—some argue that exposing patients to surgical risk without potential for direct benefit is unacceptable—but it may be the only way to separate the effects of the procedure from placebo effects and expectation.

Active comparator designs offer an alternative. Rather than trying to blind the comparison to nothing, both groups receive an intervention—the question becomes which intervention is better.

14.7 Impact on Trial Design

Decisions regarding randomization and blinding shape the overall trial architecture. Increasing the number of treatment arms complicates stratification and balance, while the planned duration of blinding must account for cross-over periods and long-term follow-up requirements. Furthermore, sponsors must decide how to handle treatment discontinuations—ensuring that even if a patient stops the study medication, their follow-up and data collection continue in a way that preserves the integrity of the randomized comparison.

14.8 Advanced Topics

For trials where treatment allocation can adapt based on patient responses, response-adaptive randomization methods assign more patients to treatments showing better outcomes (Wei and Durham 1978; Hu and Rosenberger 2006). These approaches raise the ethical appeal of treating more patients with effective therapies during the trial itself, though they introduce statistical complexities and have historically been controversial following early implementation challenges.

Covariate-adaptive randomization extends minimization by incorporating patient characteristics into allocation decisions in more sophisticated ways (Atkinson 2002; Lin, Zhu, and Su 2015). When randomization is implemented sequentially (one patient at a time), a key tension emerges: knowing previous allocations may allow prediction of the next assignment, potentially enabling selection bias. Atkinson carefully compared procedures that balance this tension between achieving covariate balance and maintaining unpredictability (Cox 2009). The FDA’s 2023 guidance on covariate adjustment recommends that sponsors prospectively specify how baseline covariates will be used in both randomization and analysis (U.S. Food and Drug Administration 2023).

For comprehensive treatment of randomization theory and practice, see Rosenberger and Lachin (Rosenberger and Lachin 2016), which covers the mathematical foundations, and Hu and Rosenberger (Hu and Rosenberger 2006) for response-adaptive methods specifically. For the theoretical foundations of adaptive randomization, including the multi-armed bandit problem and Bayesian updating, see Chapter 11.