Preface

We consider a 2-arm group sequential trial with an experimental and control group and a time-to-event endpoint. Such trials are common, for example, in cardiovascular disease and oncology. For instance, a drug for improving lipids could be compared to a standard drug to see if cardiovascular endpoints such as heart attacks or death could be prevented. In oncology trials, a new treatment might be compared to a control to see if survival or time-to-progression might be extended. Such trials can be large and complex. This technical report documents software in the gsDesign R package for designing such trials. In fact, the report is intended as a tool to thoroughly document the sample size functionality in the gsDesign R package that is supported by the nSurv() and gsSurv() functions. Some common issues that arise that are dealt with by this software are:

In the following, we assume a given Type I error, timing of interim analyses, endpoint event rates and censoring rates. There are two general approaches to carrying out calculations.

  1. Assuming fixed accrual duration and minimum follow-up duration determine the accrual rate and sample size required to provide the desired power.
  2. Assume given accrual rates over time and vary accrual or follow-up duration to achieve a desired power.

A group sequential design is generally defined by Type I and Type II error, the number and timing of interim analyses and the boundaries at interim analyses. For many endpoint types, timing is linear in the number of evaluable observations. For time-to-event endpoints, timing is related to the number of endpoint events available for analysis, which is generally a non-linear function of accrual rates and duration, endpoint event rates, censoring rates and duration of follow-up after accrual termination. Lachin and Foulkes (1986) have provided a general formulation for calculating the expected number of events over time in the scenarios described above. They also derive the sample size method used here for the case where enrollment and study durations are fixed, which requires deriving enrollment rates required to achieve the desired Type I error and power. Their methods for calculating power are extended to compute power for similarly complex designs, but where enrollment or follow-up duration are varied to achieve power given fixed enrollment rates; such methods are analogous to those of Kim and Tsiatis (1990).

This document is organized based on supporting the following calculations:

Each section presents extensive examples to show how to use the key routines.

Summary of updates

  • August, 2015. Updated nEventsIA() to work correctly for stratified populations when the simple = TRUE option is chosen. This also impacted tEventsIA() for stratified populations.
  • August, 2023 (ONGOING)
    • Updated gsSurv() to enable power computation
    • Created gsSurvCalendar() to enable calendar-based interim analysis planning
    • Incorporated revisions of code that have been put into gsDesign package
    • Fixing stratified sample size computation
    • Added EDC0, EDC1 to output of gsSurv() to match help file output specification.
  • April, 2025. Migrate from Sweave document to Quarto book.