ARTICLE |
a Department of Pediatrics, Rainbow Babies & Children's Hospital, Case Western Reserve University, Cleveland, Ohio
b Department of Pediatrics, Women & Infants Hospital, Brown University, Providence, Rhode Island
c Department of Pediatrics, Wayne State University, Detroit, Michigan
d Department of Pediatrics, Riley Hospital for Children, Indiana University, Indianapolis, Indiana
e Research Triangle Institute, Research Triangle Park, North Carolina
f Department of Pediatrics, Sharp Mary Birch Hospital, University of California, San Diego, San Diego, California
g Department of Pediatrics, Emory University, Atlanta, Georgia
h Department of Pediatrics, University of California, San Diego, San Diego, California
i Department of Pediatrics, Lucille Packard Children's Hospital, Stanford University, Palo Alto, California
j National Institute of Child Health and Human Development, Rockville, Maryland
| ABSTRACT |
|---|
|
|
|---|
METHODS. A cluster-randomized trial enrolled 4093 inborn neonates with birth weights of <1250 g at 17 centers of the National Institute of Child Health and Human Development Neonatal Research Network. Three centers were selected as best performers, and the remaining 14 centers were randomized to intervention or control. Changes in rates of survival free of bronchopulmonary dysplasia were compared between study year 1 and year 3.
RESULTS. Intervention centers implemented potentially better practices successfully; changes included reduced oxygen saturation targets and reduced exposure to mechanical ventilation. Five of 7 intervention centers and 2 of 7 control centers implemented use of high-saturation alarms to reduce oxygen exposure. Lower oxygen saturation targets reduced oxygen levels in the first week of life. Despite these changes, rates of survival free of bronchopulmonary dysplasia were all similar between intervention and control groups and remained significantly less than the rate achieved in the best-performing centers (73.3%).
CONCLUSIONS. In this cluster-randomized trial, benchmarking and multimodal quality improvement changed practices but did not reduce bronchopulmonary dysplasia rates.
Key Words: quality improvement randomized trial bronchopulmonary dysplasia
Abbreviations: QIquality improvement BPDbronchopulmonary dysplasia PMApostmenstrual age CPAPcontinuous positive airway pressure ORodds ratio CIconfidence interval
In 2000, publication of the seminal report To Err Is Human by the Institute of Medicine rocked the medical world and shook the confidence of patients in the health care system.1 The Institute of Medicine then examined deficiencies in common health care practices in the report Crossing the Quality Chasm and found that health care processes were subject to significant deviations from expected practice that led to suboptimal care and poor patient outcomes.2 Quality improvement (QI) techniques, adapted from industry, have been used to improve care and patient outcomes. However, the National Academies of Sciences found that research on QI lagged behind real-world applications.3 The Institute of Medicine concluded that most health care professionals were trained inadequately in QI.3 The optimal techniques for training teams and changing practice are not known.
One technique in QI is benchmarking, in which best-performing centers are identified and practices are examined and emulated at other centers to improve outcomes. The rationale is that institutions with excellent performance for a given outcome apply specific clinical practices that are most effective. They may also display structural or cultural organizational features that contribute to excellent outcomes. By visiting these centers and reviewing the evidence in the literature, teams from other institutions can identify these practices and organizational features. Then, by applying methods learned in QI training, the teams should be able to implement the identified practices and to modify their organizations in ways that lead to better outcomes. Although such QI teams are used increasingly in health care organizations, their efficacy has not been evaluated rigorously. Evaluations of the impact of QI teams have revealed mixed results. Shojania et al4 reviewed systematically the results of QI studies and found evidence for only modest effects, together with evidence of publication bias, with small trials being more likely to demonstrate positive results and larger trials more often yielding negative results. Also clouding the evaluation of QI is the fact that teams are often self-selected and highly motivated. It is not known whether the successful improvements in outcomes reported by such teams can be generalized when applied on a wider scale. In addition, it is unclear whether teams can select from the potentially thousands of clinical practices used to identify a subset of practices that can be applied to produce improved outcomes.
Survival rates for very low birth weight neonates (<1500 g) have improved steadily, with 83% of all such neonates surviving.5 Although most of the survivors are healthy, many develop a chronic lung injury, bronchopulmonary dysplasia (BPD), which is a significant health burden.68 In 1998, 55% of neonates with birth weights of <1250 g who were born at centers in the National Institute of Child Health and Human Development Neonatal Research Network either died or developed BPD. The incidence rates of BPD vary by center and are not explained by differences in birth weight, gestational age, race, frequency of prenatal steroid use, or incidence of respiratory distress syndrome.9 Therefore, differences in treatment practices may contribute to the development of BPD.10,11 We conducted a cluster-randomized, controlled trial to test whether NICUs trained in benchmarking and multimodal QI techniques could improve rates of survival without BPD for neonates with birth weights of <1250 g, compared with centers with usual practice.
| METHODS |
|---|
|
|
|---|
Randomization
Our intention was to improve the use of potentially better practices by the entire neonatal care team. The NICU, rather than the patient, was the unit of randomization, because the intervention was applied to a team representing the NICU. In June 2001, the 14 remaining centers were assigned randomly, with computer-generated codes prepared in sealed opaque envelopes by the data center, to the intervention group (N = 7) or the control group (N = 7). Envelopes were distributed in person, and all were opened simultaneously. A flow diagram of study participants and units is shown in Fig 1.
|
Training in QI Practices
All team members attended an 8-hour training session on QI led by a team of experts (see "Acknowledgments"). Sessions introduced systems thinking, cycles of rapid change, measurement tools, and the concept of potentially better practices.12 Teams were provided with literature reviews of care practices, including published meta-analyses and reviews from the Cochrane Collaboration. Core teams met face to face on 2 occasions and then in teleconferences. Teleconferences at 4- to 8-week intervals throughout the 2-year intervention period supported initial training. One control site and 2 intervention sites had participated in previous rapid-cycle QI processes.
Site Visits and Selection of Interventions
The teams from the intervention centers visited each benchmark center in November 2001. Benchmark centers delivered a presentation on their self-assessments of practices responsible for their high rates of survival free of BPD to the intervention teams. Teams also observed care directly at each benchmark site, collaborated in document care practices, and compared these with the benchmark self-assessments. In addition, intervention teams scrutinized extensive data collected by research nurses during the preintervention period at the benchmark centers and at their own centers.
From these data-driven assessments, teams identified 27 potentially better practices at the benchmark centers, in 3 domains, namely, delivery room care, ventilation practices, and nutrition and fluid practices (Table 1). Two other domains evaluated originally, that is, infection rates and infection control practices and organizational structure, were not different at the better-performing centers and were not selected for implementation. Overall care in the benchmark centers was characterized by ventilation with lower tidal volumes (2 centers with emphasis on use of nasal continuous positive airway pressure [CPAP] and 1 center with continued mechanical ventilation with low tidal volumes) and lower oxygen saturation targets.
|
Implementation of all selected potentially better practices was tracked with statistical control charts. Each intervention was assigned a predefined method of objective measurement based on observation of practices at the better-performing centers. For example, implementation of the use of high-saturation alarms was tracked with random audits of the use of alarms at the intervention sites and the control sites. The metrics used to track practice changes are summarized in the Appendix. Control charts (example shown in Fig 2) were generated by the data center with SAS software (SAS Institute, Cary, NC), provided to the intervention teams at 4- to 6-week intervals throughout the 2-year intervention period, and shared with all members of the NICUs to reinforce practice changes. Each team received control charts of all practices selected for implementation by any team. In this study, successful implementation of an intervention was defined as a statistically significant improvement from preintervention performance achieved within the 2-year intervention period and maintained at the end of the period. Data on practices in the best-performing centers before the study were available to the teams, but data on performance in the control centers were masked. Control centers were provided with annual summary data provided routinely to all network centers and were masked with respect to the work at the intervention centers. Control centers were prohibited from participating in other QI collaborative efforts focused on BPD. Because of ethical concerns with prohibiting practice changes in control centers during the 3-year trial period, local quality efforts initiated by clinicians who were not members of the research team were permitted. One control center changed its management approach to deemphasize mechanical ventilation and to emphasize nasal CPAP during the trial.
|
Human Subject Protection
The institutional review board at every site approved the study. One center provided families with a letter of information, and all others were given a waiver of consent requirements to collect deidentified data. The trial was registered at inception with the US National Library of Medicine trial registry (trial registration NCT00067613 [see www.clinicaltrials.gov]). In April 2003, data were reviewed by an independent data monitoring and safety committee, which recommended trial continuation.
Study Time Line and Statistical Methods
The preintervention year (study year 1) began in March 2001. Centers were assigned randomly to intervention or control in June 2001, to permit centers to free investigators for site visits in October and November 2001. Centers selected interventions, began implementation in May 2002, and continued interventions through a 2-year intervention period (study year 2 and year 3). Outcomes were compared between year 1 (March 2001 to May 2002) and year 3 (April 2003 to May 2004).
Analyses included all neonates with birth weights of <1250 g who were born at the centers and were free of major malformations. All analyses were based on an intent-to-treat model according to center assignment. Comparisons of the intervention and control centers were assessed by using mixed-model methods (SAS 9.1 loadable PROC GLIMMIX for binary outcomes and PROC MIXED for continuous outcomes), with the center entered as a random effect. These analyses accounted for the intraclass correlation within each center attributable to clustering from randomization according to center. The model for the analysis included the following terms: group (intervention or control), study year (year 1 or year 3), and group-year interaction. The group-year interaction term measures the difference between the 2 groups in changes from year 1 to year 3, which is the parameter of interest. A 0 coefficient for this term indicates a null treatment effect between the 2 groups. Other characteristics of the infants present at the time of birth were added to the model, including birth weight, gestational age of <26 weeks, race, gender, and prenatal steroid exposure. An interaction between gestational age (<26 weeks versus
26 weeks) and the main effect (group-year interaction) was added to the model if it showed a significant effect. Summary outcomes are shown when the gestational age interaction was not significant, and outcomes according to gestational age group are shown when the interaction was significant. Binary outcomes are presented as adjusted odds ratios (ORs) for year 3 versus year 1, with 95% confidence intervals (CIs), and continuous outcomes as the adjusted difference of year 3 versus year 1.19 A model with a term for severity of illness that included the Score for Neonatal Acute Physiology II with perinatal extension was also studied; the results obtained by using the additional term for severity of illness were not different from the results of the first model and are not shown.18 We prespecified secondary analyses that evaluated the impact of the intervention according to center and according to gestational age (26 weeks versus
26 weeks). All study analyses were completed with SAS 9.1 (SAS Institute).
Sample Size
In this trial, with the center rather than the individual patient as the unit of randomization, sample size calculations accounted for both interhospital and intrahospital variability. The comparison of interest was the intervention/control difference in the change in rates of survival free of BPD between study years 1 and 3. The methods of Gail et al20 from the Community Intervention Trial for Smoking Cessation were used in these calculations. Based on network data for 1999 and 2000, the rate of survival free of BPD (defined as oxygen use at 36 weeks) in inborn neonates of <1250 g was 45%. We calculated a sample size of 1400 neonates in each year of the study (100 patients per center in 14 centers) to yield 80% power (2-sided
= .05) to detect an absolute difference of 14% in the change in rates of survival free of BPD between year 1 and year 3 in the intervention versus control groups. The magnitude of the effect selected was relatively large and was based on effect size seen in the only published work of collaborative QI to reduce BPD.21
| RESULTS |
|---|
|
|
|---|
|
Interventions
The majority of intervention centers implemented their selected practices, although the rate of success did vary according to center (median rate of success: 75%; range: 40%100%). The intervention group did change respiratory care practices (Table3). Both intervention and control centers decreased the time of delivery of the first surfactant dose, with intervention centers decreasing from a median of 51 minutes to 31 minutes and control centers decreasing from a median of 41 minutes to 21 minutes. The intervention group significantly increased the use of CPAP on the first day of life (year 1: 16.9%; year 3: 24.2%) but, despite the 7.3% increase, usage was still below the rate of CPAP use in the control centers in both year 1 (26.5%) and year 3 (28.2%). Intervention centers also decreased the duration of mechanical ventilation in the first week of life (from 4.0 ± 2.7 days to 3.5 ± 2.8 days), whereas the control centers did not change significantly (3.5 ± 2.8 days versus 3.4 ± 2.8 days). Despite having persistently higher rates of intubation on day 1 of life than did control centers between study year 1 and study year 3, intervention centers decreased the total duration of respiratory support by 5.3 days, whereas the control group decreased the duration by 4.1 days.
|
The final practice changes selected by intervention centers were restrictions of intravenous fluid volumes. Four centers selected this intervention, and 3 were successful in implementation, with reductions in delivered intravenous fluid (intervention: 126.8 mL/kg per day on day 3 of life in year 1 and 117 mL/kg per day in year 3; control: 125.1122.6 mL/kg per day).
Outcomes
Intervention centers did not change the frequency of survival free of BPD faster than control centers (change score for intervention group adjusted for clustering: 0.01; 95% CI: 0.06 to 0.04; control group: +0.003; 95% CI: 0.04 to 0.05). The changes between study year 1 and study year 3 in overall survival rates, rates of survival free of BPD, and severity of BPD did not differ between the intervention and control centers (Table 4 and Fig 3). Infants cared for in intervention centers did not differ in the incidence of common neonatal morbidities, compared with those cared for in control centers (Tables 3 and 5). There was a nonsignificant trend toward increased rates of severe intraventricular hemorrhage in the intervention centers between year 1 and year 3 (14.4% vs 18.1%), compared with the control centers (13.3% vs 14.1%). The incidence rates of periventricular leukomalacia, severe retinopathy of prematurity, necrotizing enterocolitis, patent ductus arteriosus, and growth failure were similar between the 2 groups in study years 1 and 3.
|
|
|
26 weeks. As anticipated, for infants with PMA of
26 weeks across the 3 years of the trial, rates of survival to PMA of 36 weeks (94% vs 66%) and survival free of BPD (74% vs 30%) were significantly higher than those for infants with PMA of <26 weeks. Overall, results did differ according to PMA group (Fig 4). For infants with PMA of
26 weeks, there were no differences in rates of survival to PMA of 36 weeks between intervention and control centers, but a statistically nonsignificant trend toward reduced BPD rates was seen in intervention centers. In contrast, in the subgroup of infants with PMA of <26 weeks, trends toward reduced survival rates and increased rates of BPD were seen at the intervention centers (Table 4 and Fig 4). Of note, this difference was driven by a lower-than-anticipated rate of BPD in year 1. The interaction between PMA and treatment assignment was significant (P = .01) (gestational age of <26 weeks: BPD in intervention: OR: 2.53; 95% CI: 1.314.90; BPD in control: OR: 1.01; 95% CI: 0.631.63; gestational age of
26 weeks: BPD in intervention: OR: 0.80; 95% CI: 0.561.16; BPD in control: OR: 0.96; 95% CI: 0.701.33).
|
26 weeks (OR: 1.33; 95% CI: 0.752.36) but worsened outcomes were seen for infants born at PMA of <26 weeks (OR: 0.18; 95% CI: 0.060.55). Those 2 centers drove the gestational age differences described in the overall trial. In the control centers, 1 center significantly improved the rate of survival free of BPD (OR: 1.96; 95% CI: 1.003.84), whereas 5 centers showed no significant change (OR: 0.401.35) and 1 center worsened significantly (OR: 0.53; 95% CI: 0.290.99). | DISCUSSION |
|---|
|
|
|---|
By chance, centers randomized to intervention had higher rates of endotracheal intubation (77% vs 66%), lower rates of CPAP on day 1 of life (16.9% vs 26.5%), and more days of mechanical ventilation in the first 7 days of life (4.0 ± 2.7 vs 3.5 ± 2.8 days). The goal of the study was to assess the utility of benchmarking to accelerate practice changes. As is normal in QI studies, intervention centers were focused initially on the performance of the 3 benchmark centers and later on performance at their own centers. Intervention centers were masked with respect to processes at the control centers. Differences in practices at centers were anticipated in the design and represent the reason why a change score between year 1 and year 3 was used as the primary outcome statistic; such an approach corrects for potential baseline differences in patient characteristics and practices. Despite differences in practices, intervention centers did not improve rates of survival free of BPD faster than control centers. If in fact the practices on which the centers focused were important determinants of BPD, then the intervention centers should have been biased toward a greater likelihood of finding an improvement, because they had greater opportunities to improve practice.
BPD is a complex disease with multifactorial pathophysiologic processes.6,11,14 Disruption of the development of fragile lung parenchyma at a critical period of alveolar and vasculature maturation is thought to be a primary determinant of BPD. Key contributors to such disruption are oxidative injury from oxygen, ventilator-induced lung injury, and inflammation.11,2228 Interventions in this trial reduced ventilator pressures, reduced time with mechanical ventilation, and reduced oxygen concentrations. Interventions to avoid volutrauma were supported by evidence from preclinical studies in animals and human neonates and were reinforced by data and direct observation at the benchmark centers.2528 However, as shown in the Appendix, the strength of the evidence for these interventions in human neonates is weak, with few randomized trials. Existing evidence equally supports 2 philosophically divergent interventions, that is, endotracheal intubation and the early delivery of surfactant versus avoidance of endotracheal intubation and the use of CPAP.26,27 It is currently unclear which is the superior approach. Concern regarding potential injury from oxidative stress from high-oxygen environments for which preterm neonates are poorly prepared biologically led centers to emphasize oxygen reduction, the only intervention selected by all 7 intervention sites. In a posthoc analysis using both intervention and control centers, interventions focused on pressure reduction rather than oxygen reduction were more successful in reducing BPD rates (data not shown).
Although interest in QI to improve health care and outcomes is not new, rigorous randomized trials evaluating the method have been conducted only recently, with mixed results. The majority of the trials focused on improving delivery of evidence-based services to adults (eg, use of ß-receptor blockers) or improving the efficiency of care delivery (eg, reducing waiting times). Several trials tested the utility of multimodal interventions. In a cluster-randomized trial, Ferguson et al29 demonstrated improved preoperative ß-receptor blocker and internal mammary graft use in a nationwide QI effort. Although a statistically significant increase in prescription of preoperative ß-receptor blocker therapy was seen, the magnitude of the increase was modest (7.3% vs 3.6% in control centers). Mehta et al30 tested a multimodal intervention led by local opinion leaders to measure the impact on 11 indicators of the quality of acute myocardial infarction care. Some indicators improved in intervention centers, whereas others improved more in the control centers. Overall, the absolute gains ranged from 4% to 12%. Kiefe et al31 used "achievable benchmarks of care," that is, levels of performance achieved by top-performing centers. Those receiving the benchmark feedback improved delivery of influenza vaccine by 18%, compared with the control group. In contrast to these positive results, other trials seeking to improve delivery of ß-receptor blocker use after myocardial infarction, clinical preventive services, compliance with national guidelines for the treatment of hypertension and depression, and compliance with protocols for the care of patients with AIDS failed to demonstrate significant changes in practice.3235 A recently published meta-analysis of QI strategies for patients with diabetes mellitus demonstrated that most trials generated only modest improvements in glycemic control.4 The investigators also found evidence strongly suggesting publication bias, with smaller studies being more likely to show positive effects than larger studies.
Trials of QI in neonatology and pediatrics are more limited. Lozano et al36 reported the results of an intensive QI intervention conducted by Pediatric Asthma Care Patient Outcomes Research Team II. A resource-intensive intervention using organizational change plus physician peer leader education was more effective than physician education alone.36 In neonatology, the Vermont Oxford Network used multifaceted QI techniques to improve patient outcomes focused on rates of nosocomial infections (5.5% reduction, compared with 1.6% for nonparticipants) and BPD (12.5% reduction from 43.5% to 31%, compared with 8.3% reduction for nonparticipants).21 The Vermont Oxford Network group reported a subsequent study that enrolled self-selected centers focused on BPD and showed reductions in rates of BPD in before/after comparisons.37 In another study, the Vermont Oxford Network investigators demonstrated outcomes similar to those of the current trial.38 Implementation of a multimodal QI intervention resulted in earlier administration of surfactant, compared with control centers, but those improved practices did not translate into improved patient outcomes, measured as death or pneumothorax.
One important difference between the current trial and previous studies of QI to reduce BPD rates was the outcome measure we used. Previous studies used a clinical definition of BPD defined by oxygen and/or ventilation exposure at 36 weeks, without controlling for oxygen saturation values delivered. As a prelude to this trial, we developed a rigorous definition of BPD that included a room air challenge for selected infants (those receiving <30% effective oxygen).13 This physiologic definition of BPD was applied equally in the intervention and control centers. As we reported previously, the definition resulted in a mean reduction of 10% in the rates of BPD (range: 0%44%). The implementation of this definition could be considered an intervention that focused clinicians attention on the importance of integrating oxygen delivery, especially through a nasal cannula, with saturation monitoring at both intervention and control centers.39 It might have been the most effective intervention in the trial, dwarfing the effects of other potentially beneficial practices.
What accounts for our finding that the multimodal intervention failed to improve patient outcomes? One possibility is that, despite randomization, there were important random differences between intervention and control centers in the preexisting rates of survival free of BPD. However, the rates were comparable (63.3% in the intervention centers and 62.8% in the control centers in year 1). Another possibility is that our trial was not large enough to identify a clinically important benefit from benchmarking. The 95% CIs for the changes in rates of survival free of BPD resulting from benchmarking and QI excluded a benefit greater than a 4.4% improvement in the rate of survival without BPD and included a hazard as great as a 6.1% increase in the rate of death or BPD. These CIs indicate that the trial was large enough to exclude important larger effects.40,41Another possibility is that the QI training was ineffective. We think that this was not the case, because intervention centers demonstrated greater practice changes than did control centers.
A final (and we think more likely) possibility is that adopting practices from centers with exemplary outcomes may not be beneficial when there is only weak evidence supporting these practices. Well-controlled studies reporting benefits from benchmarking largely have been studies promoting the use of interventions established previously as beneficial in randomized trials. The interventions in our study were those with the strongest available evidence. However, few (such as early administration of surfactant) have been shown to be beneficial in randomized trials. Of the myriad of practice differences between centers, it remains to be established whether the practices that result in superior outcomes in benchmark centers can be reliably recognized and implemented by visiting health care teams. It is even possible that some interventions selected in QI efforts affect outcomes adversely.42,43 An example of this is the selection of a skin emollient as an intervention to reduce infection by one Vermont Oxford Network collaborative group. Emollient was shown to reduce infection rates in a single-center study but was later shown to increase infection rates in a large randomized trial.44,45 Introducing change, no matter how well intentioned, may perturb a stable system, with potentially adverse outcomes. The apparent increase in rates of severe intraventricular hemorrhage among neonates with PMA of <26 weeks in intervention centers may be a statistical anomaly or a real but unintended adverse consequence of changes in care. It is possible that interventions in the delivery room prolonged the time spent in the delivery room and contributed to hypothermia and associated intraventricular hemorrhage.
In this cluster-randomized, controlled trial, NICU teams trained in benchmarking and QI techniques benefited from the intervention with practice changes but did not improve rates of survival free of BPD in neonates with birth weights of <1250 g, compared with centers continuing usual practice. These results have implications for the design of future QI trials, in that other interventions may be required to produce change. Additional refinements are needed to create and to maintain larger magnitudes of change and to improve patient outcomes.
| APPENDIX: DESCRIPTION OF POTENTIALLY BETTER PRACTICES |
|---|
|
|
|---|
2. Respiratory therapist present at every high-risk delivery. The resuscitation team included a respiratory therapist at the benchmark sites. Level of evidence: not available; metric: audit during site visits.
3. Consistent equipment in all delivery rooms. Each resuscitation site was configured with identical equipment, to facilitate the resuscitation of high-risk patients. Level of evidence: not available; metric: audit during site visits.
4. Limited tidal volumes used in manual ventilation during resuscitation. At the benchmark sites, teams focused on limiting tidal volumes during resuscitation by assessing chest wall excursion visually or measuring delivered tidal volumes. The goal was to have barely visible chest wall movements. Evidence in animal models supports this concept. Level of evidence: level 2C; metric: audit during site visits; metric used: first peak inspiratory pressure on admission to the NICU.
5. Prophylactic use of surfactant. At 2 benchmark centers, infants at gestational ages of <28 weeks were immediately intubated and given surfactant. The third center emphasized CPAP beginning in the delivery room. Strong evidence supports this concept to minimize the severity of respiratory distress syndrome. Level of evidence: level 1A46; metric: time to surfactant use in the delivery room.
6. Use of device to provide positive end-expiratory pressure and to limit tidal volume. This practice was not in place at the benchmark centers. Intervention teams added this to the list of potentially better practices to support limitation of delivered tidal volumes. Level of evidence: not available; metric: audit during site visits.
Respiratory Care Practices
7. Selective intubation with liberal use of CPAP. One benchmark center emphasized CPAP beginning in the delivery room. The other 2 centers used intubation with prophylactic surfactant treatment. The evidence for a primary CPAP strategy is weak, with reports from small case series and observational studies. Level of evidence: indeterminate47; metric: proportion of infants treated with CPAP on admission to the NICU.
8. Early use of surfactant if intubated. The benchmark center using a primary CPAP strategy administered surfactant at once if a decision to intubate the infant was made. Evidence supports this concept to minimize the severity of respiratory distress syndrome. Level of evidence: level 148; metric: time to surfactant administration in the NICU.
9. Assessment of volume/pressure and targeting of lowest levels to achieve modest chest rise and to avoid exuberant chest wall motion if intubated. All 3 benchmark centers focused on low tidal volumes for intubated infants. All 3 centers used pressure-controlled, time-cycled ventilators. None had the capacity to measure tidal volume, and instead they used physical examination and assessment of PCO2 to limit tidal volumes. Level of evidence: level 5B; metric: mean peak inspiratory pressure on days 1 and 3 for all intubated neonates.
10. Aggressive weaning and early extubation if intubated. Two benchmark centers weaned patients aggressively and extubated them without birth weight or postnatal age limitations. In these 2 centers, teams noted during the site visits that it was common to see tiny infants at 24 hours of age receiving CPAP. The third center did not extubate patients but weaned them in the first 24 hours to low ventilatory rates and tidal volumes that were comparable to those delivered with CPAP. Level of evidence: level 5; metric: duration of ventilation in the first 7 days of life.
11. Higher PaCO2 targets for all patients. The 3 benchmark centers accepted higher PaCO2 levels to permit weaning from ventilators and/or the use of CPAP. Experimental animal data and uncontrolled human observational studies support permissive hypercapnia as a protective strategy. Level of evidence: indeterminate49; metric: mean PaCO2 for all patients with measurements on days 1 and 3 of life.
12. Lower oxygen saturation goals. The 3 benchmark centers accepted lower oxygen saturation targets of 85% to 90%. In addition, caregivers were noted, during the site visits, to observe infants during desaturation events without increasing oxygen supplementation, with the goal of allowing the infant to resolve the desaturation event independently. Level of evidence: indeterminate50; metric: mean PaO2 measured at 4 time points daily during the first 7 days of life for all those with measurements.
13. High-saturation alarm set at 95%. The 3 benchmark centers set the oxygen saturation alarm at 95% and rapidly weaned patients from supplemental oxygen when the saturation range exceeded the target. Level of evidence: not available; metric: monthly random audits of enabled saturation alarms for all infants enrolled in the trial who were receiving oxygen at control and intervention sites.
14. Avoidance of routine suctioning for patients undergoing ventilation. The 3 benchmark centers avoided suctioning for patients undergoing ventilation that was set by a time schedule and instead assessed the patients at intervals and suctioned as needed. In addition, 2 centers used an inline suction system. Level of evidence: indeterminate51; metric: audit during site visits.
15. Avoidance of hand-bagging for patients undergoing ventilation. One benchmark center prohibited the practice of ventilating with an anesthetic or self-inflating bag, as a method to limit exposure to unregulated tidal volumes. Level of evidence: not available; metric: audit during site visits.
16. Nonroutine use of analgesics/sedatives for patients undergoing ventilation. None of the benchmark centers routinely administered analgesics/sedatives to patients treated with mechanical ventilation. Instead, comfort techniques such as swaddling were used. Level of evidence: indeterminate52; metric: audit during site visits.
17. Prophylactic use of methylxanthines before extubation. Routine administration of methylxanthines before extubation was not used at the benchmark centers. Level of evidence: level 2; metric: not selected.
18. Consensus regarding ventilatory management. At 2 benchmark centers, there was high consistency in ventilator management practices among individual physicians and teams. Level of evidence: level 253; metric: not selected.
Fluid and Nutrition Practices
19. Limited intravenous fluids. At all 3 benchmark centers, intravenous fluids were initiated at 80 to 100 mL/kg per day and were adjusted by using daily weight goals. The intent was for weight loss to occur in the first 7 days of life. Level of evidence: level 254; metrics: mean intravenous fluid intake on days 1 and 3 and percentages of weight loss on days 3 and 7.
20. High-humidity environments. Two benchmark centers used high-humidity environments to limit intravenous fluid administration. Level of evidence: level 5; metric: audit during site visits.
21. Limited volume expansion to treat low blood pressure. One benchmark center used protocols to decrease treatment of low blood pressure, to limit intravenous fluid administration. Level of evidence: level 5; metric: not selected.
22. Aggressive approach to patent ductus arteriosus. One benchmark center used prophylactic indomethacin treatment, and 1 center screened for patent ductus arteriosus and treated patients with indomethacin in the first 24 hours. The third center had high rates of patent ductus arteriosus ligation. The patent ductus arteriosus was ligated for patients who experienced failure of indomethacin treatment and had persistent oxygen requirements exceeding 40%. Level of evidence: level 5; metric: not selected.
23. Early introduction of parenteral protein intake. Two benchmark centers began total parenteral nutrition administration on admission to the NICU. Level of evidence: level 5; metric: not selected.
24. Early introduction of lipids. Two benchmark centers began intravenous lipid administration by 24 hours of age. Level of evidence: indeterminate; metric: not selected.
25. Full total parenteral nutrition with increasing enteral feeding. Two benchmark centers maintained total parenteral nutrition at 100 to 120 mL/kg per day as enteral nutrition was increased. Level of evidence: not available; metric: not selected.
26. Frequent use of human milk. Two benchmark centers promoted the use of human milk and had programs in place to support human milk feedings. Both centers had human milk administered to >60% of their patients. Level of evidence: indeterminate55; metric: not selected.
27. Vitamin A prophylaxis. One benchmark center used vitamin A prophylaxis. The other 2 centers chose not to implement prophylaxis because their BPD rates were low and nurses objected to intramuscular injections. Level of evidence: level 2; metric: audit during site visits.
| ACKNOWLEDGMENTS |
|---|
Study participants were as follows: advisory committee: M. C. Walsh, MD, MS, A. A. Fanaroff, MD, Case Western Reserve University (Cleveland, OH); A. H. Jobe, MD, PhD, University of Cincinnati (Cincinnati, OH); R. Higgins, MD, National Institute of Child Health and Human Development (Bethesda, MD); N. Finer, MD, University of California, San Diego (San Diego, CA); K. Poole, PhD, Research Triangle Institute (Research Triangle Park, NC); training committee: Duncan Neuhauser, PhD, Case Western Reserve University (Cleveland, OH); Leslie Clarke, RN, MSN, MBA, Rainbow Babies & Children's Hospital (Cleveland, OH); Lynn Lostocco, RN, MSN, National Association of Children's Hospitals (Warwick, RI); Neil Finer, MD, University of California, San Diego (San Diego, CA); intervention centers: S. Nadya Kazzi, MD, MPH, K. Hayes-Hart, RN, M. Betts, RRT, S. Shankaran, MD, G. Muran, RN, Wayne State University (Detroit, MI); A. Laptook, MD, M. Martin, RN, J. Allen, RRT, University of Texas Southwestern (Dallas, TX); W. A. Engle, MD, L. Miller, RN, R. Hooper, RRT, J. Lemons, MD, Indiana University (Indianapolis, IN); W. Rhine, MD, C. Kibler, RN, J. Parker, RRT, D. Stevenson, MD, B. Ball, BS, Stanford University (Palo Alto, CA); M. Rasmussen, MD, M. Grabarczyk, BSN, C. Joseph, RRT, K. Arnell, BSN, Sharp Mary Birch Hospital for Women (San Diego, CA); G. Heldt, MD, R. Bridge, RN, J. Goodmar, RRT, N. Finer, MD, C. Henderson, RCP, University of California, San Diego (San Diego, CA); S. Buchter, MD, M. Berry, RN, I. Seabrook, RRT, B. Stoll, MD, E. Hale, RN, Emory University (Atlanta, GA); benchmark centers: S. Duara, MD, R. Everette, RN, University of Miami (Miami, FL); W. Carlo, MD, M. Collins, RN, University of Alabama (Birmingham, AL); W. Oh, MD, A. Hensman, RN, Brown University (Providence, RI); control centers: M. T. O'Shea, MD, MPH, N. Peters, RN, Wake Forest University (Winston-Salem, NC); J. Tyson, MD, MPH, G. McDavid, RN, University of Texas (Houston, TX); A. A. Fanaroff, MD, M. C. Walsh, MD, MS, N. Newman, RN, Case Western Reserve University (Cleveland, OH); D. Phelps, MD, L. Reubens, RN, University of Rochester (Rochester, NY); R. A. Ehrenkranz, MD, P. Gettner, RN, Yale University (New Haven, CT); C. Michael Cotten, MD, K. Auten, RN, Duke University (Durham, NC); E. Donovan, MD, C. Grisby RN, University of Cincinnati (Cincinnati, OH); statistical center: Qing Yao, PhD, Ken Poole, PhD, Research Triangle Institute (Research Triangle Park, NC).
We thank the nursing and medical staff members and parents of the patients in the units for their diligent implementation of this complex trial. We also thank the Neonatal Research Network Research coordinators and study nurses, without whom the trial could not have been completed.
| FOOTNOTES |
|---|
Address correspondence to Michele Walsh, MD, MS, Rainbow Babies & Children's Hospital, 11100 Euclid Ave, Mailstop 6010, Cleveland, OH 44106-6010. E-mail: michele.walsh{at}cwru.edu
The authors have indicated they have no financial relationships relevant to this article to disclose.
| REFERENCES |
|---|
|
|
|---|
| ||||||