Introduction to R Package CoRpower

The CoRpower package performs power calculations for correlates of risk, as described in Gilbert, Janes, and Huang (2016). Power/Sample Size Calculations for Assessing Correlates of Risk in Clinical Efficacy Trials. The power calculations accommodate three types of biomarkers:

trichotomous
dichotomous
continuous,

as well as two types of sampling design:

without replacement sampling for retrospective case-control designs
Bernoulli sampling for prospective case-cohort designs.

The vignette aims to illustrate distinct features of the functions in the package (with some mathematical background) by walking through a number of power calculation scenarios for different biomarker types, sampling designs, and input parameters.

The functions included in this package are:

computeN()
computePower()
plotPowerTri()
plotPowerCont()
plotRRgradVE()
plotVElatCont()

Set-up and notation | Without replacement sampling

Assume a randomized vaccine vs. placebo/control vaccine efficacy trial
Participants are followed for the first occurrence of the primary clinical study endpoint, with follow-up through time τ_max
T is the time from randomization (or enrollment) to the primary endpoint
Y = I(T ≤ τ_max) is the binary outcome of interest
Δ is the indicator that Y is observed; i.e., Δ = 0 if the participants drops out before τ_max and before experiencing the primary endpoint and Δ = 1 otherwise
N₁ is the number of vaccine recipients observed or expected to be at risk at τ (typically, τ is the biomarker sampling timepoint)
n_cases, 1 (n_{controls, 1}) is the number of observed or expected cases (controls) in vaccine recipients between τ and τ_max, where cases have ΔY = 1 and controls have Δ(1 − Y) = 1
- Note that both cases and controls have Δ = 1
- n_cases, 1 + n_{controls, 1} ≤ N₁
n_cases, 1^S (n_{controls, 1}^S) is the number of observed cases (controls) in vaccine recipients between τ and τ_max with measured biomarker response S(1) (or S^*(1))
If calculations done at design stage, then N₁, n_cases, 1, n_{controls, 1}, n_cases, 1^S, and n_{controls, 1}^S are expected counts

Algorithm for trichotomous biomarker S(1) | Without replacement sampling

Specify true overall VE between τ and τ_max
- Protocol-specified design alternative or $\widehat{VE}$
Specify risk₀ = P(Y = 1 ∣ Z = 0, Y^τ = 0) where Y^τ = I(T ≤ τ)
- Protocol-specified placebo-group endpoint rate or $\widehat{risk}_0$
Select a grid of VE₀^lat values
- E.g., ranging from VE (H₀) to 0 (maximal H₁ not allowing harm by vaccination)
Select a grid of VE₁^lat ≥ VE₀^lat values
- E.g., VE₁^lat = VE
Specify P₀^lat and P₂^lat, then P₁^lat = 1 − P₀^lat − P₂^lat
- Assuming risk₀^lat(x) = risk₀, VE = VE₀^latP₀^lat + VE₁^latP₁^lat + VE₂^latP₂^lat yields VE₂^lat
- If VE₀^lat varies from VE to 0, then VE₂^lat varies from VE to VE (P₀^lat + P₂^lat)/P₂^lat
Specify P₀ and P₂, then P₁ = 1 − P₀ − P₂
- E.g., P₀ = P₀^lat and P₂ = P₂^lat
Approach 1: Specify two of the three (Sens, FP⁰, FP¹) and two of the three (Spec, FN², FN¹)
- E.g., specify Sens and Spec and FP⁰ = FN² = 0
Approach 2: Specify σ_obs² and ρ = σ_tr²/σ_obs² and determine (Sens, Spec, FP⁰, FP¹, FN¹, FN²) (see below)
- Typically, σ_obs² = 1
- Calculation of (Sens, Spec, FP⁰, FP¹, FN¹, FN²)
  1. Assuming the classical measurement error model, where X^* ∼ N(0, ρσ_obs²), solve P₀^lat = P(X^* ≤ θ₀) and P₂^lat = P(X^* > θ₂) for θ₀ and θ₂
  2. Generate B realizations of X^* and S^* = X^* + e, where e ∼ N(0, (1 − ρ)σ_obs²), and X^* independent of e
    - B = 20, 000 by default
  3. Using θ₀ and θ₂ from the first step, define $$ \begin{align} Spec(\phi_0) &= P(S^{\ast} \leq \phi_0 \mid X^{\ast} \leq \theta_0)\\ FN^1(\phi_0) &= P(S^{\ast} \leq \phi_0 \mid X^{\ast} \in (\theta_0,\theta_2])\\ FN^2(\phi_0) &= P(S^{\ast} \leq \phi_0 \mid X^{\ast} > \theta_2)\\ Sens(\phi_2) &= P(S^{\ast} > \phi_2 \mid X^{\ast} > \theta_2)\\ FP^1(\phi_2) &= P(S^{\ast} > \phi_2 \mid X^{\ast} \in (\theta_0,\theta_2])\\ FP^0(\phi_2) &= P(S^{\ast} > \phi_2 \mid X^{\ast} \leq \theta_0) \end{align} $$
    
    Estimate Spec(ϕ₀) by $$\widehat{Spec}(\phi_0) = \frac{\#\{S^{\ast}_b \leq \phi_0, X^{\ast}_b \leq \theta_0\}}{\#\{X^{\ast}_b \leq \theta_0\}}\,$$ etc.
  4. Find ϕ₀ = ϕ₀^* and ϕ₂ = ϕ₂^* that numerically solve $$ \begin{align} P_0 &= \widehat{Spec}(\phi_0)P^{lat}_0 + \widehat{FN}^1(\phi_0)P^{lat}_1 + \widehat{FN}^2(\phi_0)P^{lat}_2\\ P_2 &= \widehat{Sens}(\phi_2)P^{lat}_2 + \widehat{FP}^1(\phi_2)P^{lat}_1 + \widehat{FP}^0(\phi_2)P^{lat}_0 \end{align} $$ and compute $$ Spec = \widehat{Spec}(\phi^{\ast}_0),\; Sens = \widehat{Sens}(\phi^{\ast}_2),\; \textrm{etc.} $$
- In Approach 2, plot $$ RR_t \quad \textrm{vs.} \quad \frac{RR^{lat}_2}{RR^{lat}_0}, $$ where RR_t is the CoR effect size defined as $$ RR_t := \frac{risk_1(2)}{risk_1(0)} = \frac{\sum_{x=0}^2 RR^{lat}_x P(X=x|S(1)=2)}{\sum_{x=0}^2 RR^{lat}_x P(X=x|S(1)=0)} $$
- If ρ < 1, then RR_t is closer to 1 than RR₂^lat/RR₀^lat
  - Note that, under the assumption of homogeneous risk in the placebo group, i.e., risk₀^lat(x) = risk₀, x = 0, 1, 2, the relative risk ratio RR₂^lat/RR₀^lat = risk₁^lat(2)/risk₁^lat(0)
  - Consequently, risk₁(2)/risk₁(0) > risk₁^lat(2)/risk₁^lat(0) because, if ρ < 1, then risk₁(2) > risk₁^lat(2) and risk₁(0) < risk₁^lat(0)
Simulate M data sets under the true parameter values:
1. N_x = (n_{controls, 1} + n_cases, 1)P_x^lat
2. (n_cases, 1(0), n_cases, 1(1), n_cases, 1(2)) ∼ Mult(n_cases, 1, (p₀, p₁, p₂)), where p_k = P(X = k|Y = 1, Y^τ = 0, Z = 1)
3. For each of the N_x subjects, generate S_i(1) by a draw from Mult(1, (p₀, p₁, p₂)), where p_k = P(S(1) = k|X = x)
4. Sample without replacement n_cases, 1^S and n_{controls, 1}^S = Kn_cases, 1^S controls with measured S(1) (R = 1), i.e., the control:case ratio is not fixed within subgroup X = x
For each observed data set, compute the 1-sided one-degree-of-freedom Wald test statistic for H₀ ⇔ {H̃₀ : β_S(1) ≥ 0} from IPW logistic regression model that accounts for biomarker sampling design (function tps in R package osDesign)
- Alternatively, use the generalized two-degree-of-freedom Wald test
Compute power as proportion of data sets with 1-sided Wald test p ≤ α/2 for specified α
Repeat power calculation varying control:case ratio, n_cases, 1, n_cases, 1^S, (P₀^lat, P₂^lat, P₀, P₂), (Sens, Spec), ρ

Illustration: hypothetical randomized placebo-controlled VE trial

Trial design

N_rand = 4, 100 participants randomized to each of the vaccine and placebo group and followed for τ_max= 24 months
Samples for measurement of S(1) at τ= 3.5 months stored in all vaccine recipients
Cumulative endpoint rate between τ and τ_max in placebo group = 3.4% ( = risk₀)
VE_{τ − τ_max} = 75%, VE_0 − τ = VE_{τ − τ_max}/2
Cumulative dropout rate between 0 and τ_max in both groups = 10%

Illustration: calculation of input parameters with `computeN()`

Assumptions

Failure time T and censoring time C are independent
T ∣ Z = 0 ∼ Exp(λ_T) and C ∣ Z = 0 ∼ Exp(λ_C)
$RR_{\tau-\tau_{\mathrm{max}}} := \frac{P(T \leq \tau_{\mathrm{max}} \mid T>\tau, Z=1)}{P(T \leq \tau_{\mathrm{max}} \mid T>\tau, Z=0)} = \frac{P(T \leq t\mid T>\tau, Z=1)}{P(T \leq t\mid T>\tau, Z=0)}$ for all t ∈ (τ, τ_max]

Number of vaccine recipients observed to be at risk at τ

$$ \begin{align} N_1 &= N_{rand}\, P(T>\tau, C>\tau \mid Z=1)\\ &= N_{rand}\, P(T>\tau \mid Z=1)\, P(C>\tau \mid Z=1)\\ &= N_{rand}\, \{1 - RR_{0-\tau}\, P(T \leq \tau \mid Z=0)\}\, P(C>\tau \mid Z=1)\\ &\approx 4,023 \end{align} $$

Number of observed cases in vaccine recipients between τ and τ_max

$$ \begin{align} n_{cases,1} &= N_1\, P(T\leq \tau_{\mathrm{max}}, T\leq C \mid T>\tau, C>\tau, Z=1)\\ &= N_1\, P(T\leq \min(C,\tau_{\mathrm{max}}) \mid T>\tau, C>\tau, Z=1)\\ &= N_1\, \frac{\int_{\tau}^{\infty}P(\tau < T \leq \min(c,\tau_{\mathrm{max}})\mid Z=1)f_C(c)\mathop{\mathrm{d}}\!c}{P(T>\tau, C>\tau \mid Z=1)}\\ &= N_1\, \frac{\bigg\{\int_{\tau}^{\tau_{\mathrm{max}}}P(\tau < T \leq c\mid Z=1)f_C(c)\mathop{\mathrm{d}}\!c + P(\tau < T \leq \tau_{\mathrm{max}}\mid Z=1) P(C>\tau_{\mathrm{max}})\bigg\}}{P(T>\tau, C>\tau \mid Z=1)}\\ &\approx 32 \end{align} $$

Number of observed controls in vaccine recipients between τ and τ_max

$$ \begin{align} n_{controls,1} &= N_1 \, P(T > \tau_{\mathrm{max}}, C > \tau_{\mathrm{max}} \mid T>\tau, C>\tau, Z=1)\\ &\approx 3,654 \end{align} $$

Number of observed cases (controls) in vaccine recipients between τ and τ_max with measured S(1)

$$ \begin{align} n^S_{cases,1} &= n_{cases,1}\\ n^S_{controls,1} &= K n^S_{cases,1} \end{align} $$

Compute N₁, n_cases, 1, n_{controls, 1}, and n_cases, 1^S with `computeN()`

library(CoRpower)
computeN(Nrand = 4100,          # participants randomized to vaccine arm
         tau = 3.5,             # biomarker sampling timepoint
         taumax = 24,           # end of follow-up
         VEtauToTaumax = 0.75,  # VE between 'tau' and 'taumax'
         VE0toTau = 0.75/2,     # VE between 0 and 'tau'
         risk0 = 0.034,         # placebo-group endpoint risk between 'tau' and 'taumax'
         dropoutRisk = 0.1,     # dropout risk between 0 and 'taumax'
         propCasesWithS = 1)    # proportion of observed cases with measured S(1)

## $N
## [1] 4023
## 
## $nCases
## [1] 33
## 
## $nControls
## [1] 3653
## 
## $nCasesWithS
## [1] 33

Illustration: `CoRpower` for trichotomous S(1) | Without replacement sampling

Approach 1 (Sens, Spec, FP⁰, FN² specified)

Scenario 1: vary control:case ratio
Scenario 2: vary Sens, Spec
Scenario 3: vary P₀^lat, P₂^lat, P₀, P₂

Approach 2 (σ_obs² and ρ specified)

Scenario 4: vary ρ
Scenario 5: vary P₀^lat, P₂^lat, P₀, P₂
Scenario 6: vary n_cases, 1

Scenario 1: vary control:case ratio (Approach 1) | Trichotomous S(1), without replacement sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,                 
                    nControlsTx = 3654,            
                    nCasesTxWithS = 32,           
                    controlCaseRatio = c(5, 3, 1), # n^S_controls : n^S_cases ratio         
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.6,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.6,                     # probability of high biomarker response
                    sens = 0.8, spec = 0.8, FP0 = 0, FN2 = 0,
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

Plot power curves with `plotPowerTri()`

Basic plotting functions are included in the package to aid with visualizing results. plotPowerTri plots the power curve against the CoR relative risk, RR_t, for trichotomous or binary biomarkers.

Output from computePower() can be saved as an object and assigned to the outComputePower input parameter.

plotPowerTri(outComputePower = pwr,  # 'computePower' output list of lists
             legendText = paste0("Control:Case = ", c("5:1", "3:1", "1:1")))

Alternatively, output from computePower() can be saved in RData files. In this case, the outComputePower input parameter should be the name(s) of the output file(s), and the outDir input parameter should be the name(s) of the file location(s). For more information, visit the plotPowerTri() help page.

computePower(..., saveDir = "myDir", saveFile = c("myFile1.RData", "myFile2.RData", "myFile3.RData"))
plotPowerTri(outComputePower = c("myFile1.RData", "myFile2.RData", "myFile3.RData"), # 'computePower' output files
             outDir = rep("~/myDir", 3),                           # path to each myFilex.RData
             legendText = paste0("Control:Case = ", c("5:1", "3:1", "1:1")))

Scenario 2: vary Sens and Spec (Approach 1) | Trichotomous S(1), without replacement sampling

pwr <- computePower(nCasesTx = 32,                 
                    nControlsTx = 3654,            
                    nCasesTxWithS = 32,           
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.6,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.6,                     # probability of high biomarker response
                    sens = c(1, 0.9, 0.8, 0.7), spec = c(1, 0.9, 0.8, 0.7), 
                    FP0 = c(0, 0, 0, 0), FN2 = c(0, 0, 0, 0),
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

plotPowerTri(outComputePower = pwr,    
             legendText = paste0("Sens = Spec = ", c(1, 0.9, 0.8, 0.7)))

Scenario 3: vary P₀^lat, P₂^lat, P₀, P₂ (Approach 1) | Trichotomous S(1), without replacement sampling

pwr <- computePower(nCasesTx = 32,                 
                    nControlsTx = 3654,            
                    nCasesTxWithS = 32,           
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = c(0.05, 0.1, 0.15, 0.2),          
                    Plat2 = c(0.15, 0.3, 0.45, 0.6),          
                    P0 = c(0.05, 0.1, 0.15, 0.2),            
                    P2 = c(0.15, 0.3, 0.45, 0.6),          
                    sens = 0.8, spec = 0.8, FP0 = 0, FN2 = 0,
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

plotPowerTri(outComputePower = pwr, 
             legendText = c("Plat0=0.05, Plat2=0.15", 
                            "Plat0=0.1, Plat2=0.3", 
                            "Plat0=0.15, Plat2=0.45", 
                            "Plat0=0.2, Plat2=0.6"))

Scenario 4: vary ρ (Approach 2) | Trichotomous S(1), without replacement sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,                 
                    nControlsTx = 3654,            
                    nCasesTxWithS = 32,           
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax 
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.6,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.6,                     # probability of high biomarker response
                    sigma2obs = 1,                # variance of observed biomarker S(1)
                    rho = c(1, 0.9, 0.7, 0.5),    # protection-relevant fraction of variance of S(1)
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

Plot power curves with `plotPowerTri()`

plotPowerTri(outComputePower = pwr,   
             legendText = paste0("rho = ", c(1, 0.9, 0.7, 0.5)))

Plot RR_t vs. RR₂^lat/RR₀^lat with `plotRRgradVE()`

plotRRgradVE() plots the ratio of relative risks for the higher and lower latent subgroups RR₂^lat/RR₀^lat against the CoR relative risk effect size RR_t = risk₁(2)/risk₁(0).

Output from computePower() can be saved as an object and assigned to the outComputePower input parameter.

plotRRgradVE(outComputePower = pwr,  # 'computePower' output list of lists
             legendText = paste0("rho = ", c(1, 0.9, 0.7, 0.5)))

computePower(..., saveDir = "myDir", saveFile = "myFile.RData")
plotRRgradVE(outComputePower = paste0("myFile_rho_", c(1, 0.9, 0.7, 0.5), ".RData"),    # files with 'computePower' output
             outDir = "~/myDir",            # path to myFile.RData
             legendText = paste0("rho = ", c(1, 0.9, 0.7, 0.5)))

Plot ROC curves with `plotROCcurveTri()`

plotROCcurveTri() plots the receiver operating characteristic (ROC) curve displaying sensitivity and specificity for a range of values for P2, P0, Plat2, and rho. For more information, visit the plotROCcurveTri() help page.

plotROCcurveTri(Plat0 = 0.2, 
                Plat2 = c(0.2, 0.3, 0.4, 0.5), 
                P0 = seq(0.90, 0.10, len=25), 
                P2 = seq(0.10, 0.90, len=25), 
                rho = c(1, 0.9, 0.7, 0.5))

Scenario 5: vary P₀^lat, P₀, P₂^lat, P₂ (Approach 2) | Trichotomous S(1), without replacement sampling

pwr <- computePower(nCasesTx = 32,                 
                    nControlsTx = 3654,            
                    nCasesTxWithS = 32,           
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax 
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = c(0.05, 0.1, 0.15, 0.2),          
                    Plat2 = c(0.15, 0.3, 0.45, 0.6),          
                    P0 = c(0.05, 0.1, 0.15, 0.2),            
                    P2 = c(0.15, 0.3, 0.45, 0.6), 
                    sigma2obs = 1,                # variance of observed biomarker S(1)
                    rho = 0.9,                    # protection-relevant fraction of variance of S(1)
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

plotPowerTri(outComputePower = pwr,  
             legendText = c("Plat0=0.05, Plat2=0.15", 
                            "Plat0=0.1, Plat2=0.3", 
                            "Plat0=0.15, Plat2=0.45", 
                            "Plat0=0.2, Plat2=0.6"))

Scenario 6: vary n_cases, 1 (Approach 2) | Trichotomous S(1), without replacement sampling

pwr <- computePower(nCasesTx = c(25, 32, 35, 40),             
                    nControlsTx = c(3661, 3654, 3651, 3646),  
                    nCasesTxWithS = c(25, 32, 35, 40),       
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk fom tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.6,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.6,                     # probability of high biomarker response
                    sigma2obs = 1,                # variance of observed biomarker S(1)
                    rho = 0.9,                    # protection-relevant fraction of variance of S(1)
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

plotPowerTri(outComputePower = pwr,   
             legendText = paste0("nCasesTx = ", c(25, 32, 35, 40)))

Illustration: `CoRpower` for binary S(1) | Without replacement sampling

Achieved by selecting P₀^lat, P₂^lat, P₀, P₂ such that $$ \begin{align} P_0^{lat} + P_2^{lat} &= 1\\ P_0 + P_2 &= 1 \end{align} $$

Approach 2 (σ_obs² and ρ specified)

Scenario 7: vary n_cases, 1

Scenario 7: vary n_cases, 1 (Approach 2) | Binary S(1), without replacement sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = c(25, 32, 35, 40),             
                    nControlsTx = c(3661, 3654, 3651, 3646),  
                    nCasesTxWithS = c(25, 32, 35, 40),       
                    controlCaseRatio = 5,         # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.8,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.8,                     # probability of high biomarker response
                    sigma2obs = 1,                # variance of observed biomarker S(1)
                    rho = 0.9,                    # protection-relevant fraction of variance of S(1)
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "binary")          # "continuous" by default

Plot power curves with `plotPowerTri()`

plotPowerTri(outComputePower = pwr,   
             legendText = paste0("nCasesTx = ", c(25, 32, 35, 40)))

Algorithm for continuous biomarker S^*(1) | Without replacement sampling

Specify overall VE between τ and τ_max
- Protocol-specified design alternative or $\widehat{VE}$
Specify risk₀
- Protocol-specified placebo-group endpoint rate or $\widehat{risk}_0$
Specify P_lowestVE^lat, ρ, and a grid of VE_lowest values (e.g., ranging from VE to 0)
- Fixed (VE, risk₀, P_lowestVE^lat, VE_lowest, ρ) and $$ \begin{align} risk^{lat}_1(x^{\ast}) &= (1 - VE_{lowest}) risk_0,\quad x^{\ast} \leq \nu\\ \mathop{\mathrm{logit}}\{risk^{lat}_1(x^{\ast})\} &= \alpha^{lat}+\beta^{lat}x^{\ast},\quad x^{\ast} \geq \nu\\ VE &= 1 - \frac{\int risk^{lat}_1(x^{\ast})\phi(x^{\ast}/\sqrt{\rho} \sigma_{obs})\mathop{\mathrm{d}}\!x^{\ast}}{risk_0} \end{align} $$ yield α^lat and β^lat
- Plot VE_x^*^lat vs. x^* and calculate the pertaining CoR effect size exp (β^lat) for each level of VE_lowest
Simulate M data sets under the true parameter values:
1. Sample X^* for n_cases, 1 cases from f_X^*(x^*|Y = 1, Y^τ = 0, Z = 1) using Bayes rule
2. Sample X^* for n_{controls, 1} controls from f_X^*(x^*|Y = 0, Y^τ = 0, Z = 1) using Bayes rule
  - How? Use a fine grid of x̃^* values and then
  sample(x̃^*, prob=f_X^*(x̃^*|Y = ⋅, Y^τ = 0, Z = 1), replace=TRUE)
3. Sample S^*(1) following S^*(1) = X^* + e
4. Sample without replacement n_cases, 1^S and n_{controls, 1}^S = Kn_cases, 1^S controls with measured S^*(1) (R = 1)
For each observed data set, compute the 1-sided one-degree-of-freedom Wald test statistic for H₀ ⇔ {H̃₀ : β_S^*(1) ≥ 0} from IPW logistic regression model that accounts for biomarker sampling design (function tps in R package osDesign)
Compute power as proportion of data sets with 1-sided Wald test p ≤ α/2 for specified α
Repeat power calculation varying control:case ratio, n_cases, 1, n_cases, 1^S, P_lowestVE^lat, ρ

Illustration: `CoRpower` for continuous S^*(1) | Without replacement sampling

Scenario 8: vary ρ
Scenario 9: vary P_lowestVE^lat

Scenario 8: vary ρ | Continuous S^*(1), without replacement sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,             
                    nControlsTx = 3654,         
                    nCasesTxWithS = 32,         
                    controlCaseRatio = 5,        # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,            # overall VE
                    risk0 = 0.034,               # placebo-group endpoint risk from tau - taumax
                    PlatVElowest = 0.2,          # prevalence of VE_lowest
                    VElowest = seq(0, VEoverall, len=100), # lowest VE for true biomarker X*<=nu
                    sigma2obs = 1,               # variance of observed biomarker S
                    rho = c(1, 0.9, 0.7, 0.5)    # protection-relevant fraction of variance of S
                    M = 1000,                    # number of simulated clinical trials
                    alpha = 0.05,                # two-sided Wald test Type 1 error rate
                    biomType = "continuous")     # "continuous" by default

Plot power curves with `plotPowerCont()`

plotPowerCont() plots the power curve against the CoR relative risk, RR_c, for continuous biomarkers.

Output from computePower() can be saved as an object and assigned to the outComputePower input parameter. In this scenario, since computePower() was run multiple times to vary the controls:cases ratio, these multiple output objects can be read into the function as a list.

plotPowerCont(outComputePower = pwr,          # output list of lists from 'computePower'
              legendText = paste0("rho = ", c(1, 0.9, 0.7, 0.5)))

computePower(..., saveDir = "myDir", saveFile = "myFile.RData")
plotPowerCont(outComputePower = paste0("myFile_rho_", c(1, 0.9, 0.7, 0.5), ".RData"),     # files with 'computePower' output
              outDir = "~/myDir",             # path to myFile.RData
              legendText = paste0("rho = ", c(1, 0.9, 0.7, 0.5)))

Plot VE_x^*^lat curves with `plotVElatCont()`

plotVElatCont() plots the vaccine efficacy (VE) curve for the true biomarker X=x for eight different values of the true CoR relative risk, , in vaccine recipients and the lowest vaccine efficacy level for the true biomarker, .

outComputePower contains output from a single run of computePower() with no varying argument (i.e., no vectorized input parameters other than VE₀^lat, VE₁^lat, and ). This output can be in the form of an assigned object, which is a list of lists of length 1, or a character string specifying the file containing the output. Note that this is unlike plotPowerTri() and plotPowerCont(), which can take in output from computePower() in the form of a list of lists of length greater than 1 or a character vector. For more information, visit the plotVElatCont() help page.

Using the function when computePower() output is saved as list object pwr:

plotVElatCont(outComputePower = pwr)

Using the function when computePower() output is saved in a file with name “myFile” and location “~/myDir”:

computePower(..., saveDir = "myDir", saveFile = "myFile.RData")
plotVElatCont(outComputePower = "myFile.RData",   
              outDir = "~/myDir")

Scenario 9: vary P_lowestVE^lat | Continuous S^*(1), without replacement sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,               
                    nControlsTx = 3654,         
                    nCasesTxWithS = 32,         
                    controlCaseRatio = 5,        # n^S_controls : n^S_cases ratio
                    VEoverall = 0.75,            # overall VE
                    risk0 = 0.034,               # placebo-group endpoint risk from tau - taumax
                    PlatVElowest = c(0.05, 0.1, 0.15, 0.2),         
                    VElowest = seq(0, VEoverall, len=100), # lowest VE for true biomarker X*<=nu
                    sigma2obs = 1,               # variance of observed biomarker S(1)
                    rho = 0.9                    # protection-relevant fraction of variance of S(1)
                    M = 1000,                    # number of simulated clinical trials
                    alpha = 0.05,                # two-sided Wald test Type 1 error rate
                    biomType = "continuous")     # "continuous" by default

Plot power curves with `plotPowerCont()`

plotPowerCont(outComputePower = pwr,          # output list of lists from 'computePower'
              legendText = paste0("PlatVElowest = ", c(0.05, 0.1, 0.15, 0.2)))

Bernoulli / case-cohort sampling of S(1) (or S^*(1))

Bernoulli sample at baseline with sampling probability p
S(1) (or S^*(1)) measured at τ in
- a subset of the sample with Y^τ = 0, and
- all cases with Y^τ = 0
Implications:
- n_cases, 1 = n_cases, 1^S
- design parameter n_{controls, 1}^S replaced by probability p because n_{controls, 1}^S is a random variable in case-cohort designs

Illustration: `CoRpower` for trichotomous S(1) and continuous S^*(1) | Bernoulli sampling

Trichotomous S(1) (Approach 1)

Scenario 10: vary p

Continuous S^*(1)

Scenario 11: vary p

Scenario 10: vary p (Approach 1) | Trichotomous S(1), Bernoulli sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,             
                    nControlsTx = 3654,       
                    nCasesTxWithS = 32,       
                    cohort = TRUE,                # FALSE by default
                    p = c(0.01, 0.02, 0.03, 0.05),               
                    VEoverall = 0.75,             # overall VE
                    risk0 = 0.034,                # placebo-group endpoint risk from tau - taumax
                    VElat0 = seq(0, VEoverall, len=100), # grid of VE (V/P) among lower protected
                    VElat1 = rep(VEoverall, 100), # grid of VE (V/P) among medium protected
                    Plat0 = 0.2,                  # prevalence of lower protected
                    Plat2 = 0.6,                  # prevalence of higher protected
                    P0 = 0.2,                     # probability of low biomarker response
                    P2 = 0.6,                     # probability of high biomarker response
                    sens = 0.8, spec = 0.8, FP0 = 0, FN2 = 0,
                    M = 1000,                     # number of simulated clinical trials
                    alpha = 0.05,                 # two-sided Wald test Type 1 error rate
                    biomType = "trichotomous")    # "continuous" by default

Plot power curves with `plotPowerTri()`

plotPowerTri(outComputePower = pwr,  # 'computePower' output
             legendText = paste0("Cohort p = ", c(0.01, 0.02, 0.03, 0.05)))

Scenario 11: vary p | Continuous S^*(1), Bernoulli sampling

Run simulations and compute power with `computePower()`

pwr <- computePower(nCasesTx = 32,             
                    nControlsTx = 3654,        
                    nCasesTxWithS = 32,       
                    cohort = TRUE,               # FALSE by default
                    p = c(0.01, 0.02, 0.03, 0.05),                  
                    VEoverall = 0.75,            # overall VE
                    risk0 = 0.034,               # placebo-group endpoint risk from tau - taumax
                    PlatVElowest = 0.2,          # prevalence of VE_lowest
                    VElowest = seq(0, VEoverall, len=100), # lowest VE for true biomarker X*<=nu
                    sigma2obs = 1,               # variance of observed biomarker S(1)
                    rho = 0.9                    # protection-relevant fraction of variance of S(1)
                    M = 1000,                    # number of simulated clinical trials
                    alpha = 0.05,                # two-sided Wald test Type 1 error rate
                    biomType = "continuous")     # "continuous" by default

Plot power curves with `plotPowerCont()`

plotPowerCont(outComputePower = pwr,  # 'computePower' output
              legendText = paste0("Cohort p = ", c(0.01, 0.02, 0.03, 0.05)))

Introduction to R Package CoRpower

Set-up and notation | Without replacement sampling

Algorithm for trichotomous biomarker S(1) | Without replacement sampling

Illustration: hypothetical randomized placebo-controlled VE trial

Trial design

Illustration: calculation of input parameters with computeN()

Assumptions

Number of vaccine recipients observed to be at risk at τ

Number of observed cases in vaccine recipients between τ and τmax

Number of observed controls in vaccine recipients between τ and τmax

Number of observed cases (controls) in vaccine recipients between τ and τmax with measured S(1)

Compute N1, ncases, 1, ncontrols, 1, and ncases, 1S with computeN()

Illustration: CoRpower for trichotomous S(1) | Without replacement sampling

Scenario 1: vary control:case ratio (Approach 1) | Trichotomous S(1), without replacement sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerTri()

Scenario 2: vary Sens and Spec (Approach 1) | Trichotomous S(1), without replacement sampling

Scenario 3: vary P0lat, P2lat, P0, P2 (Approach 1) | Trichotomous S(1), without replacement sampling

Scenario 4: vary ρ (Approach 2) | Trichotomous S(1), without replacement sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerTri()

Plot RRt vs. RR2lat/RR0lat with plotRRgradVE()

Plot ROC curves with plotROCcurveTri()

Scenario 5: vary P0lat, P0, P2lat, P2 (Approach 2) | Trichotomous S(1), without replacement sampling

Scenario 6: vary ncases, 1 (Approach 2) | Trichotomous S(1), without replacement sampling

Illustration: CoRpower for binary S(1) | Without replacement sampling

Scenario 7: vary ncases, 1 (Approach 2) | Binary S(1), without replacement sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerTri()

Algorithm for continuous biomarker S*(1) | Without replacement sampling

Illustration: CoRpower for continuous S*(1) | Without replacement sampling

Scenario 8: vary ρ | Continuous S*(1), without replacement sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerCont()

Plot VEx*lat curves with plotVElatCont()

Scenario 9: vary PlowestVElat | Continuous S*(1), without replacement sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerCont()

Bernoulli / case-cohort sampling of S(1) (or S*(1))

Illustration: CoRpower for trichotomous S(1) and continuous S*(1) | Bernoulli sampling

Scenario 10: vary p (Approach 1) | Trichotomous S(1), Bernoulli sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerTri()

Scenario 11: vary p | Continuous S*(1), Bernoulli sampling

Run simulations and compute power with computePower()

Plot power curves with plotPowerCont()

Introduction to R Package `CoRpower`

Illustration: calculation of input parameters with `computeN()`

Number of observed cases in vaccine recipients between τ and τ_max

Number of observed controls in vaccine recipients between τ and τ_max

Number of observed cases (controls) in vaccine recipients between τ and τ_max with measured S(1)

Compute N₁, n_cases, 1, n_{controls, 1}, and n_cases, 1^S with `computeN()`

Illustration: `CoRpower` for trichotomous S(1) | Without replacement sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerTri()`

Scenario 3: vary P₀^lat, P₂^lat, P₀, P₂ (Approach 1) | Trichotomous S(1), without replacement sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerTri()`

Plot RR_t vs. RR₂^lat/RR₀^lat with `plotRRgradVE()`

Plot ROC curves with `plotROCcurveTri()`

Scenario 5: vary P₀^lat, P₀, P₂^lat, P₂ (Approach 2) | Trichotomous S(1), without replacement sampling

Scenario 6: vary n_cases, 1 (Approach 2) | Trichotomous S(1), without replacement sampling

Illustration: `CoRpower` for binary S(1) | Without replacement sampling

Scenario 7: vary n_cases, 1 (Approach 2) | Binary S(1), without replacement sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerTri()`

Algorithm for continuous biomarker S^*(1) | Without replacement sampling

Illustration: `CoRpower` for continuous S^*(1) | Without replacement sampling

Scenario 8: vary ρ | Continuous S^*(1), without replacement sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerCont()`

Plot VE_x^*^lat curves with `plotVElatCont()`

Scenario 9: vary P_lowestVE^lat | Continuous S^*(1), without replacement sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerCont()`

Bernoulli / case-cohort sampling of S(1) (or S^*(1))

Illustration: `CoRpower` for trichotomous S(1) and continuous S^*(1) | Bernoulli sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerTri()`

Scenario 11: vary p | Continuous S^*(1), Bernoulli sampling

Run simulations and compute power with `computePower()`

Plot power curves with `plotPowerCont()`