| | Background and methodology of MONITOR-GCSF, a pharmaco-epidemiological study of the multi-level determinants, predictors, and clinical outcomes of febrile neutropenia prophylaxis with biosimilar granulocyte-colony stimulating factor filgrastimAccepted 27 January 2010. published online 02 March 2010. Corrected Proof Abstract The MONITOR-GCSF study is an international, prospective, observational, pharmaco-epidemiological study to evaluate the multi-level factors and outcomes associated with the use of Zarzio® in the prophylaxis of febrile neutropenia in chemotherapy-treated cancer patients. Driven by a novel, integrated, multi-focal framework for post-approval observational studies, it examines determinants of response at both the patient and the physician level; integrates statistical methodologies from the social and behavioral sciences; assesses factors predictive of poor treatment response; and evaluates the congruence of treatment with EORTC guidelines and the approved label. This pan-European study will recruit at least 1000 patients from a minimum of 75 centers and follow them for maximum 6 cycles of chemotherapy. Apart from descriptive and associative procedures, statistical analysis will include variance attribution methods; hierarchical linear, logistic, and Poisson modeling; Kaplan–Meier time-to-event analysis, Mantel–Cox log-rank or generalized Wilcoxon–Breslow tests, and Cox proportional hazards modeling; and clustering and related data mining techniques. 1. Background  Chemotherapy-induced neutropenia is a frequent and potentially life-threatening complication experienced in patients undergoing cancer treatment. It regularly evolves into febrile neutropenia, a medical emergency most often requiring hospitalization. Febrile neutropenia may also trigger reductions in chemotherapy dose intensity or delays in chemotherapy regimens [1], [2] as well as delays and cancellations of surgery, thus jeopardizing the effectiveness of antineoplastic treatment. Febrile neutropenia is associated with increased morbidity, mortality, and health care costs [3]; and impacts on quality of life by increasing anxiety, depression, and symptom burden, and by decreasing activity level [4], [5]. The U.S. National Cancer Institute scale for severity of neutropenia distinguishes between four grades of chemotherapy-induced neutropenia based on absolute neutrophil count: grade 0 (≥2000/mm3 or ≥2.0 × 109/L), grade 1 (≥1500 to <2000/mm3 or ≥1.5 to <2.0 × 109/L), grade 2 (≥1000 to <1500/mm3 or ≥1.0 to <1.5 × 109/L), grade 3 (≥500 to <1000/mm3 or ≥0.5 to <1.0 × 109/L), and grade 4 <500/mm3 or <0.5 × 109/L [6]. Febrile neutropenia is defined as a patient on chemotherapy with grade 4 neutropenia and an axillary temperature ≥38.5 °C or two or more febrile episodes at ≥38 °C within a 12-h period [7]. Both patient- and regimen-specific risk factors for developing chemotherapy-induced neutropenia have been identified [8], [11], which enables clinicians to quantify subsequent febrile neutropenia risk and to initiate prophylactic treatment. Colony stimulating factors (CSFs) are biological growth factors that stimulate the production of white blood cells. Granulocyte-colony stimulating factor (G-CSF), granulocyte-macrophage colony stimulating factor (GM-CSF), and more recently pegylated G-CSF (pGCSF) have been approved for use in prophylactic or therapeutic management of febrile neutropenia [1], [9], [10], [11], [12]. CSFs promote the proliferation, differentiation, and activation of neutrophils in the bone marrow. The ensuing shorter transition time from stem cell to mature neutrophil results in a larger number of functional and mature circulating neutrophils [13]. The most common side effects are bone, joint and muscle pain, elevations in blood levels of uric acid and liver enzymes, decreases in blood glucose, leucocytosis, thrombocytopenia, anemia, headache, nose bleed, and enlarged spleen. All CSFs are recommended equally for preventing and managing febrile neutropenia [10]. Two decades of research have documented the efficacy of CSFs in both treating and preventing febrile neutropenia in cancer patients [11]. In two randomized clinical trials the incidence of febrile neutropenia in patients not receiving CSFs prophylactically exceeded 40% [14], [15]. The incidence of serious infection in patients with either grade 3 or grade 4 neutropenia was 80–100% [16]. Mortality in patients who develop febrile neutropenia is 8–10% [17] and has been found to be as high as 82% in high-risk patients [3]. Primary prophylaxis using CSFs has been shown to decrease the risk of febrile neutropenia by as much as 50% [3], [11], [17], the severity and duration of neutropenia [3], and the number of chemotherapy dose reductions or treatment delays [5]. In an effort to standardize care and promote optimal patient outcomes, evidence-based practice guidelines regarding the primary and secondary prophylaxis and treatment of both febrile and afebrile neutropenia have been developed by the European Organization for Research and Treatment of Cancer (EORTC) [10], the American Society of Clinical Oncology (ASCO) [18], the National Comprehensive Cancer Network (NCCN) [19] and the European Society of Medical Oncology (ESMO) [20]. Though largely convergent in their clinical guidance, these guidelines differ with regard to primary prophylaxis. The ESMO guidelines recommend that CSFs be administered if a patient's risk for febrile neutropenia is >40% [20]. In contrast, the ASCO, EORTC, and NCCN guidelines recommend treatment with CSFs when the risk of febrile neutropenia is >20% [10], [18], [19]. The EORTC and NCCN guidelines also advise that CSFs be considered if a patient's febrile neutropenia risk is between 10 and 20% and the patient presents with other compromising factors (e.g., advanced age, history of febrile neutropenia, etc.) [10], [19]. In fact, it was argued recently that prophylaxis with myeloid growth factors should not be based solely on the chemotherapy-associated risk of febrile neutropenia, but should include such factors as host, age, performance status, and comorbidities including organ failures [21], though further evidence is needed. It has been shown that managing anemia in chemotherapy-treated cancer patients in congruence with the applicable EORTC guidelines is associated with better hemoglobin outcomes [22]. However, the extent to which prophylaxis of febrile neutropenia in cancer patients is congruent with guidelines, and how this affects patient outcomes, has not been studied. There is a need for prospective studies evaluating prophylaxis options and patterns [11], response to prophylaxis with G-CSF, congruence with guidelines and associated patient outcomes, while also examining patient risk profiles. This will enable a better understanding of the multitude of “real-world” factors at the patient, provider, and center levels associated with treatment outcomes; promote the attainment of treatment goals and targets; and reduce the risks and sequelae associated with febrile neutropenia. In addition to these scientific issues, the recent approval of the first biosimilar CSF filgrastim (marketed as Zarzio®) brings with it the need for observational studies to examine how Zarzio® is used in daily practice and the clinical outcomes achieved. This agent was approved by the European Medicines Evaluation Agency following the Agency's biosimilar approval pathway [23]. Phase I studies confirmed bioequivalence in terms of pharmacodynamics and pharmacokinetics of Zarzio® and the reference product (Neupogen®) under intravenous and subcutaneous administration. In an open single-arm multicenter Phase III study in breast cancer patients, Zarzio® showed to be efficacious and safe [24]. 2. Integrated framework for observational effectiveness studies  Randomized controlled trials (RCT) are the indicated method for determining the efficacy of pharmacological agents. However, by necessity RCTs are constrained in terms of patients and clinicians included, and treatments must be limited to the agent under investigation so as to be able to draw unconfounded efficacy inferences. Observational studies are needed to examine the effectiveness of drugs previously documented to be efficacious: how a treatment works under ordinary and variable conditions, prescribed by licensed clinicians with varying degrees of expertise and practicing across the spectrum of health care settings, to treat a heterogeneity of eligible patients. Most observational studies focus narrowly on evaluating a treatment's effectiveness under “real-world” conditions, leaving key questions unanswered. The question of “whether the treatment works?”, while critical, does not address the equally important questions of “when does the treatment work, and when not?”, “in whom does the treatment work, and in whom not?”, “why does the treatment work in some patients but not in others?”, “why does the treatment work with some clinicians but not with others?”, and “why is the treatment tolerated by some patients but not by others?” To answer these questions, the MONITOR-GCSF study has adopted an integrated framework for observational effectiveness studies, used now in several studies, that has become a de facto quality model to assure clinical relevance, scientific value, and technical merit (see Fig. 1). 3. Aims of the MONITOR-GCSF study  Within the framework (Fig. 1), MONITOR-GCSF is an international, prospective, observational, open-label, pharmaco-epidemiologic study of cancer patients at risk for chemotherapy-induced febrile neutropenia who, in their treating physician's best clinical judgment, are receiving commercially available biosimilar filgrastim (marketed as Zarzio®) for prophylactic purposes. In first instance, the MONITOR-GCSF study aims to describe the patient population at risk for febrile neutropenia and treated prophylactically with Zarzio®; describe prophylaxis patterns involving Zarzio®; and identify the multi-level determinants of variability in hematology levels and variability in outcomes: absolute neutrophil count; impact on chemotherapy delivery, surgery, and radiotherapy; and mortality treatment-related mortality (cancer-related mortality; mortality due to causes other than G-CSF treatment, chemotherapy, or cancer; and overall mortality). Secondarily, the MONITOR-GCSF study aims to identify patient cohorts who are vulnerable to poor prophylaxis response and experience break-through episodes of febrile neutropenia; understand the differences between those patients who respond and those who do not respond to Zarzio®; and describe the degree to which prophylaxis of febrile neutropenia is in congruence with guideline recommendations. By focusing on patients treated prophylactically, the MONITOR-GCSF study aims to extend the Prospective Observational European Neutropenia Study conducted by the Impact of Neutropenia in Chemotherapy European Study Group (INC-EU; http://www.inceu.org) [25]. Importantly so but also with the inherent limitations of scope, the INC-EU study examines only the incidence of severe neutropenia in chemotherapy-treated cancer patients; its impact on dose delays and reductions; the associations between risk factors, occurrence, and impaired chemotherapy delivery; and the development of deterministic risk models. The study's specific objectives and associated research questions, as derived from the framework (Fig. 1), are summarized in Table 1. | | |  | | Objective | Research questions |  |
|---|
 | Primary objectives |  |  | Case finding and patient description |  |  |  Objective 1 | To describe the cancer patients requiring chemotherapy who, in their treating physician's best clinical judgment, are receiving Zarzio® for the prophylaxis of febrile neutropenia in terms of demographics, clinical status, medical history, concomitant comorbid conditions and current status of disease, and prior and concomitant medications. | What are the demographics, clinical status, medical history, concomitant comorbid conditions and current status of disease, and prior and concomitant medications of cancer patients at risk for febrile neutropenia receiving Zarzio® prophylactically. |  |  |
|  |  | Treatment patterns |  |  |  Objective 2 | To describe Zarzio® prophylaxis patterns for febrile neutropenia over up to 6 cycles of chemotherapy, with or without prior, concurrent, or later radiotherapy or surgery. | What different prophylaxis patterns involving Zarzio® for febrile neutropenia can be identified over up to 6 cycles of chemotherapy; with prophylaxis patterns defined as the aggregate patterns of dosing and duration of Zarzio®, any dosing modifications, and concomitant medications? Is there variability in Zarzio® patterns in patients at risk for febrile neutropenia? |  |  |
|  |  | Congruence of treatment with guidelines and dosing |  |  |  Objective 3 | To determine the extent to which the prophylaxis of febrile neutropenia in cancer patients is in congruence with the EORTC best practice guidelines and dosing recommendations, and whether this is associated with better outcomes. | To what extent is the prophylaxis of febrile neutropenia in congruence with EORTC guideline recommendations? Is this associated with better treatment outcomes? Can any variability in outcomes be explained by variability in length of treatment and/or dosing. |  |  |
|  |  | Description of observed outcomes |  |  |  Objective 4 | To describe hematological outcomes observed in association with prophylactic use of Zarzio® in patients at risk for febrile neutropenia; including break-through episodes of febrile neutropenia. | What hematological outcomes are observed over the course of prophylaxis with Zarzio® in patients at risk for febrile neutropenia? What is the incidence of break-through episodes of febrile neutropenia? What are the respective proportions of patients that meet vs. those who do not meet guideline recommended outcomes? What are the observed patterns of hematological variability over the course of treatment with Zarzio®? |  |  |  Objective 5 | To describe the distribution of chemotherapy dose delays and reductions, surgery delays and cancellations, radiotherapy delays, dose reductions, and cancellations, and mortality (GCSF-related; febrile neutropenia-related; cancer-related; not related to GCSF, febrile neutropenia, or cancer; all-cause); and estimate the time-to-event for such events over the course of Zarzio® treatment. | What are the distributions of chemotherapy dose delays and reductions, surgery delays and cancellations, and radiotherapy delay, dose reductions, and cancellations, and mortality (GCSF-related; febrile neutropenia-related; cancer-related; not related to GCSF, febrile neutropenia, or cancer; all-cause) over the course of prophylactic treatment with Zarzio®, in general and in patients with break-through febrile neutropenia episodes? When do these events occur? What variables predispose patients to the occurrence of such events? |  |  |
|  |  | Multi-level determinants of outcomes |  |  |  Objective 6 | To examine the multi-level determinants (patient, center) of hematological outcomes of prophylaxis with Zarzio® to better understand the variability in outcomes achieved. | How are patient- and center-level variables, independently and/or in interaction, related to hematological outcomes over the course of prophylaxis with Zarzio®? |  |  |
|  |  | Secondary objectives |  |  | Cohort identification and differentiation |  |  |  Objective 7 | To identify different latent clusters of end-stage cancer patients receiving chemotherapy and being treated with Zarzio® for the treatment or prophylaxis of febrile neutropenia using statistical data mining techniques to profile patients based on medical history, concomitant comorbid conditions, and current clinical status. | Can two or more clusters of chemotherapy-treated cancer patients receiving Zarzio® prophylactically be identified, quantified, tested for statistically significant difference, and described in clinical terms. |  |  |
|  |  | Non-responder analyses |  |  |  Objective 8 | To model patient- and center-level variables between patients who responded and those who did not respond to prophylaxis with Zarzio®. | Are there differences in patient- and center-level variables, independently or in interaction, between responders and non-responders to prophylaxis with filgrastim in terms of hematological levels and variability therein? |  |  |  Objective 9 | To model patient- and center-level variables between patients who had chemotherapy dose delays or reductions, surgery delays and cancellations, and radiotherapy delays, dose reductions, or cancellations vs. no such events during the course of prophylaxis with Zarzio®. | Are there differences in patient- and center-level variables, independently or interaction, between patients with and without chemotherapy dose delays and dose reductions, radiotherapy delays, dose reductions, or cancellations, and surgery delays of cancellations? |  |  |  Objective 10 | To model patient- and center-level variables between patients who died vs. survived during the course of prophylaxis with Zarzio®, in all patients and those with break-through febrile neutropenia episodes. | Are there differences in patient- and center-level variables, independently or interaction, between patient (all patients and stratified by purpose of use) who died versus those who survived during the course of treatment with Zarzio®, differentiating between GCSF-related mortality; febrile neutropenia-related mortality; cancer-related mortality; mortality due to causes other than G-CSF treatment, chemotherapy, or cancer; and all-cause mortality? |  | | | |
4. Methods and patients  4.1. Study design MONITOR-GCSF is an international, prospective, observational, multi-level, pharmaco-epidemiological study in which chemotherapy-treated cancer patients are started on Zarzio® for the prophylaxis of febrile neutropenia per their prescribing physician's best clinical judgment. Potential centers and physician-investigators are identified by the local affiliate of the study sponsor (Sandoz Biopharmaceuticals, Holzkirchen, Germany) using a Study Briefing document summarizing the study (Fig. 2). The multi-level design of this observational study is warranted for both clinical and statistical reasons. Clinically, it can be assumed that patients under the care of the same physician (or, for that matter, at the same center) are uniquely and exclusively exposed to that physician's knowledge, experience, expertise, and clinical practice patterns. At the center level, patient care may be influenced by clinical policies, procedures, and protocols. Statistically, this exposure to the same physician or center means that these patients are treated with a certain communality that may be different across investigators and centers participating in the study. Extending this across physician-investigators and centers, observations on the total sample of patients are not independent, thus violating a major assumption for statistical testing. In multi-level (or hierarchical linear) modeling, the effect of class (e.g., treating physician) on patient-level outcomes are estimated before the effect of between patient variability is determined; yielding an attribution of variance and identification of level-specific predictors of patient outcomes that separates between-class and within-patient variability. Both CSF-naive patients and patients previously treated with other CSFs will be screened for eligibility. Those meeting the inclusion and exclusion criteria will be informed about the study and written informed consent will be obtained. The enrollment period is 36 months. This long period was chosen to allow sufficient time for market entry of Zarzio® in the participating countries and the recruitment of sufficient study centers. Patients will be evaluated over six chemotherapy cycles. The total duration of the study protocol is approximately 42 months. As one aim of the study is to examine the influence of center and physician data, all participating centers and physician-investigators are asked to complete some questionnaires prior to enrolling their first patient. Fig. 3 summarizes the main procedures of the MONITOR-GCSF study. Table 2 presents the study's timetable and the assessments to be performed at each time point.  | Visit | 1 | 2–5 | 6 |  |  | Chemotherapy cycle | 1 | 2 through 5 | 6 |  |  | Inclusion/exclusion criteria | X | | |  |  | Informed consent/patient permission | X | | |  |  | Demographic data | X | | |  |  | Height and weight | X | | |  |  | Body surface area | X | | |  |  | Medical history | X | | |  |  | Cancer history | X | | |  |  | History of repeated infections | X | | |  |  | Prior cancer treatments (chemo- and non-chemotherapy) | X | | |  |  | Cancer histology | X | | |  |  | Chemotherapy-induced neutropenia (febrile and afebrile) history | X | | |  |  | Prior treatments for chemotherapy-induced (febrile and afebrile) neutropenia | X | | |  |  | Cancer stage | X | | |  |  | Chemotherapy regimen (type, dose, frequency, NCI toxicity index) | X | X | X |  |  | Changes in chemotherapy regimen (incl. dose reduction and dose delays) | X | X | X |  |  | Radiotherapy regimen | X | X | X |  |  | Changes in radiotherapy regimen (incl. delays, dose reductions, and cancellations) | X | X | X |  |  | Surgery | X | X | X |  |  | Changes in surgery (incl. delays, cancellations) | X | X | X |  |  | Performance status (ECOG or Karnovsky) | X | X | X |  |  | Recent or current infections | X | X | X |  |  | Febrile neutropenia episode (start, end, duration, temperature, hospitalization, treatments) | X | X | X |  |  | Antibiotic prophylaxis | X | X | X |  |  | G-CSF prophylaxis | X | X | X |  |  | Blood and urine cultures | X | X | X |  |  | If febrile, reason for fever | X | X | X |  |  | Target ANC for Zarzio® treatment | X | X | X |  |  | Zarzio® dose and frequency | X | X | X |  |  | Concomitant Antibiotics, antivirals, antifungals, corticosteroids, and analgesics | X | X | X |  |  | Absolute neutrophil count (ANC), complete blood count (CBC), C-reactive protein, serum albumin (incl. assessment of anemia, leukocytosis, thrombocytopenia, …) | X | X | X |  |  | Bone, joint, and muscle pain | X | X | X |  |  | Headache/neurological symptoms | X | X | X |  |  | Epistaxis, skin hemorrhage, gastro-intestinal, other bleeding | X | X | X |  |  | Adherence assessment | X | X | X |  |  | Physician's questionnaire (demographics, knowledge of neutropenia guidelines) | X | | |  |  | Center characteristics, neutropenia protocols, adoption of neutropenia guidelines | X | | |  | | | |
4.2. Study populations The MONITOR-GCSF study is a pan-European study to which Austria, Belgium, France, Germany, Italy, Poland, Spain, and the United Kingdom have already committed (and assured the required sample size). Additional countries may join as Zarzio® market entry expands in the coming years. The patient inclusion criteria are the following: •Male or female adults (age ≥18 years). •Diagnosed with one of the following types and stages of tumors: stage III or IV breast cancer; stage III or IV bladder cancer; stage III or IV lung cancer; metastatic prostate cancer; and stage III or IV diffuse large B-cell lymphoma. •Scheduled for first cycle of chemotherapy regimen (first line, second line, and third line). •Contemplated to receive at least 4 cycles of chemotherapy. •Treated with commercially available Zarzio® per physician's best clinical judgment and per current European Zarzio® label. •Female patients must be either post-menopausal for 1 year or surgically sterile or using effective contraceptive methods such as barrier method with spermicide or an intra-uterine device. Oral contraceptive use is allowed. •Informed written consent to participate in the study by patients or their legal guardian. Not eligible will be patients meeting any of the following exclusion criteria: •Patients with myeloid malignancies. •Sensitivity to Zarzio® or any other CSF. •Hypersensitivity to Escherichia coli-derived proteins. •Treated with dose-dense chemotherapy regimen. •Radiotherapy to ≥20% of total body bone. •Infection within 2 weeks of starting current line of chemotherapy. •Patients with medical condition(s) that in view of the investigator prohibits participation in the study. •Patients with willfully negligent nonadherence to their cancer treatment. •Use of any investigational agent in the 30 days prior to enrollment. •Women of childbearing potential not using the contraception method(s) described above. •Women who are breastfeeding. While patients are the main study population, the multi-level design of the study also requires a minimum number of physician-investigators. At least 1000 patients (allowing for a 25% attrition rate) are to be recruited by a minimum of 75 physicians to achieve the statistical power needed to meet the study objectives. To enable the greatest possible diversity in physician-investigators so as to better reflect the heterogeneity of providers, physician criteria are limited to any person licensed to practice medicine in his/her country of origin and practicing, at least part-time, in oncology and/or hematology. 4.3. Data collection and management Being an observational study, all data will be recorded as available. There are no mandatory treatment regimens, assessments, and tests. Investigators enter data from source documents into the electronic CRF by the investigator (see Fig. 4 for an eCRF screen shot). Centralized monitoring will be performed by a contract research organization, which will also assure query management. To the centers’ knowledge, a random selection of 10% of patients will be identified for complete on-site monitoring. 4.4. Sample size calculations Because the diversity of this study's objectives and the associated diversity in statistical analyses, several sample size calculations were performed and subsequently integrated into the recommended sample sizes for, respectively, patients and practices. All calculations assume a desired power level of 0.80, alpha at 0.05 and two-tailed tests to detect a small effect size. For Objectives 1, 2, 4, and 5: These are descriptive objectives on available data as generated by this study. The aim is mere description without inferences to populations at large. No sample size calculations based on either power or precision apply. For Objective 3: The first research questions associated with this objective are descriptive; thus, no sample size calculations based on either power or precision apply. Using binary independent variables (congruent vs. non-congruent), a sample size of 800 observations (of which 66% respond and 34% do not respond to treatment) achieves 81% power to detect a minimal change in the probability of non-response from the baseline value of 0.340 to 0.442. This change corresponds to a minimal detectable odds ratio of 1.540. A two-sided log-rank test with an overall sample size of 800 subjects achieves 81% power to detect the difference in survival proportions between 0.76 and 0.67, the proportions surviving in two groups of the same size. This corresponds to a minimal detectable hazard ratio [HR] of 1.459. Using continuous independent variables (e.g. level of congruence) a sample size of 800 observations achieves 81% power to detect a minimal change in the probability of non-response from the value of 0.340 at the mean of X to 0.392 when X is increased to one standard deviation above the mean. This change corresponds to a minimal detectable odds ratio of 1.250. A minimal detectable log HR on a covariate with a standard deviation of 1.75 based on a sample of 800 observations achieves 82% power to detect minimal regression coefficients equal to ±0.125 (HR ≥1.133 or ≤0.882), in a multivariate Cox proportional hazards model with an expected model covariance of 0.10. ANC as a continuous variable will also be considered. We assume that up to 15 variables may be entered into the models and that small increments in R2 of 0.025 must be detectable. A sample size of 747 would be required to conduct a multiple regression procedure based on these assumptions. For Objective 6: In the absence of formal sample size estimation procedures for multi-level modeling, we consider two approaches for sample size estimation for regression, one with a continuous (e.g., ANC-based outcomes; multiple regression) and one with a discrete criterion variable (e.g., ANC targets reached; logistic regression). In the case of the continuous criterion variable, we assume that up to 15 variables may be entered into the models and that small increments in R2 of .025 must be detectable. A sample size of 747 would be required to conduct a multiple regression procedure based on these assumptions. In the case of the discrete criterion variable, we conservatively assumed that analysis should be able to detect an odds ratio of 1.50, that the prevalence of the binary response with no exposure was random (0.50), and that the odds ratio for treatment with Zarzio® is 1 (as all patients would be treated with Zarzio®). Under these assumptions, a sample size of 774 patients would be required to achieve power of 0.80 at alpha = 0.05. Since this study is also focused on the center-level determinants associated with clinical and safety outcomes, we estimated the number of centers that must be recruited to allow inferences at that level. Specific to sample size estimations, we assumed that center-level variables would be secondary to patient-level variables as determinants/predictors of ANC and safety outcomes, and therefore that only robust determinants should be retained in modeling efforts. To this effect, we assumed that up to 5 variables may be entered into the models and that increments in R2 of 0.15 would be meaningful and must be detectable. In the case of the continuous criterion variable, a minimum of 75 centers would be required to conduct a multiple regression procedure. In the case of the discrete criterion variable, we conservatively assumed that analysis should be able to detect an odds ratio of 3.5 for a binary and 2.5 for a continuous independent variable. These ratios are higher than in the patient calculations, however we argue that only solid center-level differences are of interest, not minimal variations between centers. We assumed that the prevalence of the binary response with no exposure was random (0.50). Under these assumptions, a minimal sample size of 75 centers would be required to achieve power of 0.80 at alpha = 0.05. Using established techniques [26], [27], a two-sided log rank test with an overall sample size of 800 subjects achieves 81% power to detect the difference in survival proportions between 0.76 and 0.67, the proportions surviving in two groups of the same size. This corresponds to a HR of 1.459. A minimal detectable log HR on a covariate with a standard deviation of 1.75 based on a sample of 800 observations achieves 82% power to detect minimal regression coefficients equal to ±0.125 (HR ≥1.133 or ≤0.882), in a multivariate Cox proportional hazards model with an expected model covariance of 0.10. For Objective 7: There are no formal sample size estimation procedures for clustering and related data mining techniques, except the expectation that sample sizes be “as large as possible.” Assuming that the data model for this study includes X physician level data, and Y patient data, the total would be V = X + Y variables, if all variables were included in the analysis. As the latter is unlikely, we can set V < X + Y. Further, it will be important to test whether profiles/cohorts identified by the above techniques are statistically different from each other on key parameters. Sample size estimation procedures for comparing two or more groups were applied to determine minimum sample sizes for comparing two, three, four, and five cluster solutions. For univariate comparisons of two clusters, a sample size of 394 patients per cluster (total N = 788) would be required to detect a small standardized effect size of f = 0.20, with a sample standard deviation of 2.0 following the non-central t distribution. For univariate comparisons among three clusters, a sample size of 257 patients per cluster (total N = 741) would be required to detect a small standardized effect size of f = 0.114, with a sample standard deviation of 1.75 following the non-central F distribution. For univariate comparisons among four clusters, a sample size of 155 patients per cluster (total N = 620) would be required to detect a small standardized effect size of f = 0.133, with a sample standard deviation of 1.50 following the non-central F distribution. For univariate comparisons among five clusters, a sample size of 136 patients per cluster (total N = 680) would be required to detect a small standardized effect size of f = 0.133, with a sample standard deviation of 1.50 following the non-central F distribution. For Objectives 8, 9, and 10: A logistic regression of a binary response variable (target ANC achieved) on a binary independent variable (X) with a sample size of 800 observations (of which 66% achieve target ANC and 34% are non-responders) achieves 81% power to detect a minimal change in the probability of non-response from the baseline value of 0.340 to 0.442. This change corresponds to a minimal detectable odds ratio of 1.540. A logistic regression of a binary response variable (target ANC achieved) on a continuous, normally distributed variable (X) with a sample size of 800 observations achieves 81% power to detect a minimal change in the probability of non-response from the value of 0.340 at the mean of X to 0.392 when X is increased to one standard deviation above the mean. This change corresponds to a minimal detectable odds ratio of 1.250. A two-sided log-rank test with an overall sample size of 800 subjects achieves 81% power to detect the difference in survival proportions between 0.76 and 0.67, the proportions surviving in two groups of the same size. This corresponds to a minimal detectable HR of 1.459. A minimal detectable log HR on a covariate with a standard deviation of 1.75 based on a sample of 800 observations achieves 82% power to detect minimal regression coefficients equal to ±0.125 (HR ≥1.133 or ≤0.882), in a multivariate Cox proportional hazards model with an expected model covariance of 0.10. Summary and recommendation: Required patient sample size estimates range from 439 (for two-sample t-tests) to 800 (for time-to-event and non-responder analyses). Taking the higher estimates, we propose to add a 25% margin for patient attrition, thus raising the required sample size to a minimum of 1000 patients to be recruited from at least 75 centers. Each center will be asked to enroll up to 15 patients. 4.5. Statistical analysis Objectives 1 and 2: Descriptive statistics of frequency, central tendency, and dispersion will be used under consideration of applicable levels of measurement. Secondary correlational/associative analyses may be performed if descriptive analyses suggest the presence of associations among variables. In that case, associative analyses will be performed under consideration of the levels of measurement of the variables involved. Differences in treatment by patient subgroups will be evaluated using Fisher's exact, χ2, Student's t, analysis of variance, Mann–Whitney U, or Kruskal–Wallis tests where appropriate. Objective 3: Descriptive statistics of frequency, central tendency, and dispersion will be used under consideration of applicable levels of measurement. Secondary correlational/associative analyses may be performed if descriptive analyses suggest the presence of associations among variables. In that case, associative analyses will be performed under consideration of the levels of measurement of the variables involved. Hierarchical modeling will be used to determine if treatment congruence is associated with differences in clinical and safety outcomes. Objective 4: In addition to appropriate descriptive statistics as above, an index of intra-individual variation in ANC (“Intra-ANC”) levels will be calculated following Nesselroade and Salthouse's [28] proposal to calculate the standard deviation of a subject's multiple ANC measurements. While one could also use the variance, the standard deviation approach yields a metric scaled on these original data and respective means. One-way (within-subjects) repeated measures analysis of variance and factorial (within-subjects and between-subjects) using the Geisser-Greenhouse corrected F-test will be used to examine this objective. In addition, though in an exploratory fashion and data permitting, we will attempt to perform time series analysis under various scenarios of autoregressive lag to explore the presence of trend effects in the 6-cycle data. Objective 5: Proportions, median follow-up duration, median event-free survival times, and events per patient year will be used to describe the safety endpoints of hospitalization and all-cause mortality over the entire study period (0–6 cycles). The distribution of events and event-free survival will be described for the entire sample using Kaplan–Meier estimator modeling. Mantel–Cox log-rank tests (weighing all time points equally) or generalized Wilcoxon–Breslow tests (weighing time points relative to the number of cases at risk at each time point) will be used to evaluate bivariate differences in survival distribution between identified subgroups. Cox proportional hazards modeling will be used to analyze the multivariate effects of identified risk factors and other determinants on survival under the proportional hazards assumption. Adjusted HRs and 95% confidence intervals will be calculated to test the direction and strength of the influence of individual factors on event-risk and event-free survival. Omnibus tests of model coefficients, −2log likelihood and Nagelkerke pseudo R2 also will be calculated to determine variable block and overall model fit and significance. Objective 6: We will apply hierarchical modeling to test the relationship of patient- and physician/center-level variables and treatment response. This is a two-level model and consists of two sub-models, one for each level. In this study, patients are nested within centers and physicians, and two-level models will be used for patient/center and patient/physician. We will apply this method in multivariate linear regression and logistic regression models of ANC outcomes in which slope coefficients or odds ratios and 95% confidence intervals will be calculated. Adjusted R2 or Nagelkerke pseudo R2 will be calculated where appropriate. The intraclass correlation coefficient (range 0.00–1.00) will be used to quantify the variability in patient outcome attributable to within-center or within-physician variability before any patient-level determinants are considered. We will also apply the above method in Cox proportional hazards modeling (calculating adjusted hazards ratios and 95% confidence intervals) of safety outcomes. In addition, in an exploratory fashion and data permitting, time-dependent covariates will be assessed in multivariate Cox proportional hazards models, and we will evaluate the multi-level determinants of the count of overnight hospitalizations using hierarchical Poisson regression modeling, calculating incident rate ratios and 95% confidence intervals. Objective 7: To identify cohorts (“clusters”) we will apply sequentially a series of increasingly more complex procedures: aggregation techniques (“classification and clustering methods”), differentiation techniques (“tree-based models”), and associative and pattern recognition techniques (e.g., “neural networks”). Aggregation procedures will be done on the sample as a whole. However, if we need to migrate to differentiation and associative/pattern recognition procedures, we will randomly divide the sample into a training set (for model development) and a testing set (for model validation). Once (two or more) cohorts have been identified, we will use both univariate and multivariate statistical significance testing procedures to measure differences between the two or more groups on key differentiating variables. In the event of 2 cohorts, the univariate test will be the Student's t-test for independent samples and the multivariate test will be the Hotelling's T2. In the event of 3 or more cohorts, the univariate test will be analysis of variance and the multivariate will be multivariate analysis of variance. As some key differentiating variables may be at ordinal or nominal levels of measurement, appropriate adjustments will be made in the choice of statistical models – recognizing that multivariate comparisons may not be possible. Objectives 8, 9, and 10: Multivariate logistic regression, Kaplan–Meier estimation, and Cox proportional hazards modeling will be used to address these objectives. 4.6. Interim analyses In addition to the end-of-study analyses, interim analyses are planned after patients enrolled in the first 3 months of the enrollment period have completed 6 cycles of follow-up (approx. month 9); and 12 (approx. month 21) and 24 (approx. month 33) months later, using a Pocock-adjusted level of statistical significance [29], [30]. 5. Summary  The MONITOR-GCSF study is an international, prospective, observational, pharmaco-epidemiological study to evaluate the multi-level factors and outcomes associated with the use of Zarzio® in the prophylaxis of febrile neutropenia in cancer patients being treated with chemotherapy. Driven by a novel, integrated, multi-focal framework for post-approval observational studies, it examines determinants of response at both the patient and the physician level; integrates an advocated statistical methodology hence to be used mainly in the social and behavioral sciences; assesses factors potentially predictive of poor treatment response; and evaluates the extent to which treatment is congruent with evidence-based guidelines and the approved label. Authorship contributions  Study design: M. Aapro, I. Abraham, P. Gascón, H. Ludwig, K. MacDonald, M. Muenzberg, N. Rosencher, and M. Turner. Study preparation: I. Abraham, C. Lee, K. MacDonald, M. Turner, and M. Song. Data management plan: C. Lee, K. MacDonald, and M. Song. Statistical analysis plan: I. Abraham, C. Lee, and M. Song. Manuscript draft: I. Abraham. Critical review of manuscript: M. Aapro, I. Abraham, P. Gascón, H. Ludwig, K. MacDonald, M. Muenzberg, N. Rosencher, and M. Turner. Conflicts of interest  All authors completed the ICMJE uniform disclosure form for potential conflicts of interest. M. Turner and M. Muenzberg are employees of Sandoz Biopharmaceuticals. M. Song, K. MacDonald, C. Lee, and I. Abraham are employees of Matrix45. By company policy, employees are prohibited from owning equity in client organizations (except through mutual funds or other independently administered collective investment instruments) or contracting independently with client organizations. Matrix45 provides similar services to other biopharmaceutical companies. Reviewers  Dr. Cesare Gridelli, S G Moscati Hospital-Vellino, Division of Medical Oncology, Via Circumvallazione, I-83100 Avellino, Italy. Prof. Jean Klastersky, Institut Jules Bordet, Department of Medicine, 121 Boulevard de Waterloo, B-1000 Brussels, Belgium. Acknowledgments  •This study is sponsored by Sandoz Biopharmaceuticals. •The authors thank Erin Arizmendi for editorial, proofreading, and administrative assistance. References  [1]. [1]Lalami Y, Paesmans M, Aoun M, et al. A prospective randomized evaluation of G-CSF or G-CSF plus oral antibiotics in chemotherapy-treated patients at high risk of developing febrile neutropenia. Supportive Care in Cancer. 2004;12:725–730. MEDLINE |
CrossRef
[2]. [2]Lyman GH, Lyman CH, Agboola O. Risk models for predicting chemotherapy-induced neutropenia. The Oncologist. 2005;10:427–437. MEDLINE |
CrossRef
[3]. [3]Kuderer N, Dale D, Crawford J, Cosler L, Lyman G. Impact of primary prophylaxis with granulocyte colony-stimulating factor or febrile neutropenia and mortality in adult cancer patients receiving chemotherapy: a systematic review. Journal of Clinical Oncology. 2007;25:3158–3167.
CrossRef
[4]. [4]Calhoun E, Chang CH, Welshman E, Cella D. The impact of chemotherapy delays on quality of life in patients with cancer. Journal of Supportive Oncology. 2004;2:64–65. [5]. [5]Wingard JR, Elmongy M. Strategies for minimizing complications of neutropenia: prophylactic myeloid growth factors or antibiotics. Critical Reviews in Oncology and Hematology. 2009;. [6]. [6]National Cancer Institute . Cancer therapy evaluation program, common toxicity criteria, version 2.0. 2009; http://ctep.info.nih.gov/protocolDevelopment/electronic_applications/docs/ctcv20_4-30-992.pdf. [7]. [7]Lalami Y, Paesmans M, Muanza F, et al. Can we predict the duration of chemotherapy-induced neutropenia in febrile neutropenic patients, focusing on regimen-specific risk factors? A retrospective analysis. Annals of Oncology. 2006;17:507–514. MEDLINE |
CrossRef
[8]. [8]Crawford J, Dale D, Lyman GH. Chemotherapy-induced neutropenia. Cancer. 2003;100:228–237. [9]. [9]Komorokji R, Lyman G. The colony-stimulating factors: use to prevent and treat neutropenia and its complications. Expert Opinion on Biological Therapies. 2004;4:1897–1910. [10]. [10]Aapro M, Cameron D, Pettengell R, et al. EORTC guidelines for the use of granulocyte-colony stimulating factor to reduce the incidence of chemotherapy-induced febrile neutropenia in adult patients with lymphomas and solid tumours. European Journal of Cancer. 2006;42:2433–2453. [11]. [11]Gridelli C, Aapro M, Barni S, et al. Role of colony stimulating factors (CSFs) in solid tumours: results of an expert panel. Critical Reviews in Oncology and Hematology. 2007;63:53–64. [12]. [12]Zielinski C, Awada A, Cameron D, Cufer T, Martin M, Aapro M. The impact of new European Organisation for Research and Treatment of Cancer guidelines on the use of granulocyte colony-stimulating factor on the management of breast cancer patients. European Journal of Cancer. 2008;44:353–365. [13]. [13]Raposo CG, Marin AP, Barón MG. Colony-stimulating factors: clinical evidence for treatment and prophylaxis of chemotherapy-induced febrile neutropenia. Clinical Translational Oncology. 2006;8:729–734.
CrossRef
[14]. [14]Crawford J, Ozer H, Stoller R, et al. Reduction in granulocyte colony-stimulating factor of fever and neutropenia induced by chemotherapy in patients with small-cell lung cancer. New England Journal of Medicine. 1991;325:164–170. MEDLINE |
CrossRef
[15]. [15]Trillet-Lenoir V, Green J, Manegold C, et al. Recombinant granulocyte colony-stimulating factor reduces the infectious complications of cytotoxic chemotherapy. European Journal of Cancer. 2006;29A:319–324. [16]. [16]Mangi MH, Newland AC. Febrile neutropenia: prophylactic and therapeutic use of GM-CSF. European Journal of Cancer. 2000;35:4–7. [17]. [17]Morrison V, Wong M, Hershman D, Campos L, Ding B, Malin J. Observational study of the prevalence of febrile neutropenia in patients who received filgrastim or pegfilgrastim associated with 3–4 week chemotherapy regimens in community oncology practices. Journal of Managed Care Pharmacy. 2007;13:337–348. MEDLINE [18]. [18]Smith T, Khatcheressian J, Lyman G, et al. 2006 update of recommendations for the use of white blood cell growth factors: an evidence-based clinical practice guideline. Journal of Clinical Oncology. 2006;24:3187–3205.
CrossRef
[19]. [19]Lyman GH. Guidelines of the National Comprehensive Cancer Network on the use of myeloid growth factors with cancer chemotherapy: a review of evidence. Journal of the National Comprehensive Cancer Network. 2005;3:557–571. MEDLINE [20]. [20]Greil R, Jost LM. ESMO recommendations for the application of hematopoietic growth factors. Annals of Oncology. 2005;16(Suppl. 1):80–82. [21]. [21]Klastersky J, Awada A, Aoun M, Paesmans M. Should the indications for the use of myeloid growth factors for the prevention of febrile neutropenia in cancer patients be extended?. Current Opinion in Oncology. 2009;21:297–302.
CrossRef
[22]. [22]Aapro M, MacDonald K, Van Erps J, et al. Managing cancer-related anemia in congruence with the EORTC guidelines is an independent predictor of haemoglobin outcome: initial evidence from the RESPOND study. European Journal of Cancer. 2009;45:8–11. [23]. [23]Last accessed, 30 November, 2009. http://www.emea.europa.eu/humandocs/Humans/EPAR/zarzio/zarzio.htm. [24]. [24]Gascón P, Fuhr U, Sörgel F, et al. Development of a new G-CSF product based on biosimilarity assessment. Annals of Oncology. 2009;. [25]. [25]Pettengell R, Bosly A, Szucs T, et al. Multivariate analysis of febrile neutropenia occurrence in patients with non-Hodgkins lymphoma: data from the INC-EU prospective observational European neutropenia study. British Journal of Haematology. 2008;144:677–685.
CrossRef
[26]. [26]Hsieh FY, Lavori PW. Sample-size calculations for the Cox proportional hazards regression model with nonbinary covariates. Controlled Clinical Trials. 2000;21:552–560. Abstract | Full Text |
Full-Text PDF (171 KB)
|
CrossRef
[27]. [27]Machin D, Campbell M, Fayers P, Pinol A. Sample size tables for clinical studies. 2nd ed.. Malden, MA: Blackwell Science; 1997;. [28]. [28]Nesselroade JR, Salthouse TA. Methodological and theoretical implications of intra-individual variability in perceptual-motor performance. Journal of Gerontology: Psychological Science. 2004;59B:49–55. [29]. [29]Geller NL, Pocock SJ. Interim analyses in randomized clinical trials: ramifications and guidelines for practitioners. Biometrics. 1987;43:213–223.
CrossRef
[30]. [30]Schulz KF, Grimes DA. Multiplicity in randomised trials II: subgroup and interim analyses. The Lancet. 2005;365:1657–1661. P. Gascón is Head of the Division of Medical Oncology of the Department of Hematology-Oncology of the Hospital Cliníc de Barcelona, where he also serves as the department's Scientific Director. He is also a Professor in the Faculty of Medicine of the Universitat de Barcelona (Spain). M. Aapro is the Dean of the Institut Multidisciplinaire d’Oncology at the Clinique de Genolier (Switzerland). He is also a Consulting Physician in the Division of Oncology and Hematology of the Hôpital Cantonal Universitaire, Genève (Switzerland). H. Ludwig is Chairman of the Department of Oncology and Hematology of the Wilhelminenspital in Wien. He is also a Professor at the Medizinische Universität Wien (Austria). N. Rosencher is an anesthesiologist at the Hôpital Cochin in Paris and a Professor at Paris Descartes University in Paris (France). M. Turner is the Global Medical Leader for oncology and nephrology in the biopharmaceuticals division of Sandoz (a Novartis company) in Holzkirchen (Germany). M. Song is an Associate Research Scientist at Matrix45 (Philadelphia, PA, USA). She will be starting a postdoctoral fellowship in epidemiology at the Centers for Disease Control and Prevention in Atlanta, GA in July 2010. K. MacDonald is a Principal at Matrix45 (Earlysville, VA, USA), where she serves as Chief Executive. C. Lee is a Research Scientist at Matrix45 (Philadelphia, PA). He was a postdoctoral fellow in clinical outcomes and effectiveness research in the Center for Health Outcomes and Pharmacoeconomic Research, College of Pharmacy, The University of Arizona (Tucson, AZ, USA) at the time of the development of the study and this paper. He will be starting as Assistant Professor, School of Nursing, Oregon Health and Science University (Portalnd, OR, USA) in June 2010. M. Muenzberg is the Global Head of Medical Affairs in the biopharmaceuticals division of Sandoz (a Novartis company) in Holzkirchen (Germany). I. Abraham is a Principal at Matrix45 (Earlysville, VA, USA), where he serves as Chief Scientist. He is also a Professor and Investigator in the Center for Health Outcomes and Pharmacoeconomic Research, College of Pharmacy, The University of Arizona (Tucson, AZ, USA). a Division of Medical Oncology, Department of Hematology-Oncology, Hospital Clínic de Barcelona, University of Barcelona, Calle Villarroel 170, 08036 Barcelona, Spain b Institut Multidisciplinaire d’Oncologie, Clinique de Genolier, 1, route du Muids, 1272 Genolier, Switzerland c Medizinische Abteilung I – Onkologie und Haematologie, Wilhelminenspital, Montleartstrasse 37, 1160 Wien, Austria d Département d’Anesthésiologie et Soins Intensifs, Hôpital Cochin and Université Paris 5 Descartes, 27, rue du Faubourg-Saint-Jacques, 75014 Paris, France e Sandoz Biopharmaceuticals, Holzkirchen, Germany f Matrix45, Earlysville, VA, USA g School of Nursing, University of Pennsylvania, 418 Curie Blvd., Philadelphia, PA 19104, USA h Center for Health Outcomes and Pharmacoeconomic Research, The University of Arizona, Tucson, AZ, USA Corresponding author at: Matrix45, 620 Frays Ridge Road, Earlysville, VA 22936, USA. Tel.: +1 434 978 1045; fax: +1 978 945 8374.
PII: S1040-8428(10)00029-6 doi:10.1016/j.critrevonc.2010.01.014 © 2010 Elsevier Ireland Ltd. All rights reserved. | |
|