ELECTRONIC ARTICLE |
,
,||
,¶
* Department of Cardiology, Children's Hospital Boston, Boston, Massachusetts
Clinical Research Program, Children's Hospital Boston, Boston, Massachusetts
Department of Pediatrics, Harvard Medical School, Boston, Massachusetts
|| Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts
¶ Department of Medicine, Children's Hospital Boston, Boston, Massachusetts
# Department of Neurology, Children's Hospital Boston, Boston, Massachusetts
** Department of Neurology, Harvard Medical School, Boston, Massachusetts
| ABSTRACT |
|---|
|
|
|---|
Methods. In the Boston Circulatory Arrest Study, 135 children with D-transposition of the great arteries repaired by arterial switch operation were administered the Bayley Scales of Infant Development and the Fagan Test of Infant Intelligence at 1 year of age and the Wechsler Intelligence Scale for Children, Third Edition and the Wechsler Individual Achievement Test at 8 years.
Results. Although most 1-year test scores were significantly associated with 8-year test scores, the amounts of shared variance were modest (<10%). All 1-year test scores had poor sensitivity (16%32%) and poor positive predictive value (35%42%) but good specificity (80%93%) and negative predictive value (78%79%). More than half of the children with low scores at 8 years (
85) had had scores >84 at 1 year.
Conclusion. This pattern suggests that although test scores at 1 year are modestly associated with test scores at 8 years, many children who are at risk for poor late outcomes will not be identified on the basis of 1-year test scores. Long-term follow-up of children with CHD is necessary to draw inferences about the developmental sequelae of preoperative, intraoperative, and postoperative factors.
Key Words: infant heart surgery prediction development congenital heart defects
Abbreviations: BSID, Bayley Scales of Infant Development DTGA, D-transposition of the great arteries MDI, Mental Development Index PDI, Psychomotor Development Index CI, confidence interval PPV, positive predictive value NPV, negative predictive value OR, odds ratio
Children with congenital heart defects are at increased risk for later educational and behavioral difficulties,1 although it is not certain whether these difficulties are attributable to aspects of the reparative surgeries that they undergo in infancy, preexisting central nervous system anomalies, or a combination of these and other factors. Regardless of cause, the early identification of children who are at developmental risk is necessary to implement appropriate diagnostic evaluations and interventions in a timely manner.
One obstacle to early identification of cognitive problems is the view that an infant's performance on the most commonly used neurodevelopmental tests, such as the Bayley Scales of Infant Development (BSID), have poor predictive validity.2 Scores on sensory-motor tests such as the BSID are, however, more predictive among high-risk compared with low-risk children.3 Infant tests that assess information processing skills, such as the Fagan Test of Infant Intelligence, are purported to be better than sensory-motor tests in identifying children who are at risk for later mental retardation.4
We present here secondary analyses of data collected as part of a clinical trial comparing the developmental and neurologic outcomes associated with the 2 major vital organ support methods used in infant heart surgery: deep hypothermia with either total circulatory arrest or continuous low-flow cardiopulmonary bypass.5 We address 2 questions: (1) to what extent can the developmental status of children at 8 years be predicted on the basis of their neurodevelopmental status at 1 year? and (2) in a cohort of children with a history of open heart surgery in infancy, is a test that assesses early information processing skills more predictive than a test that assesses early sensory-motor skills?
| METHODS |
|---|
|
|
|---|
Data Collection
One-Year Assessment
Two neurodevelopmental tests were administered by 1 of 2 examiners.6
1. BSID.7 This is a widely used test of infant development. It yields 2 scores, the Mental Development Index (MDI) and the Psychomotor Development Index (PDI), both of which have a normative population mean of 100 and an SD of 16. Thus, a score
84, 1 or more SDs below the mean, is considered low. In the restandardization of the BSID, however, the means of contemporary infants on this version of the test were found to be considerably higher: 112 for MDI and 110 for PDI.8 Thus, an MDI or PDI score of 84 on the 1969 version of the BSID is, in effect,
1.5 SD below expected. The MDI assesses skills such as imitation, vocalization, and rudimentary problem solving. The PDI assesses fine and gross motor skills.
2. Fagan Test of Infant Intelligence.9 This test assesses visual recognition memory using a 10-trial habituation format. In each trial, familiarization to a photograph of a face is followed by paired presentation of the face with either a photograph of a similar but distinct face or with the same face in a different orientation. By observing the infant's corneal reflections of the 2 stimuli, a "percent Novelty Preference" score is obtained and summed across trials. A score <53% identifies children who are at risk for later mental retardation.
Eight-Year Assessment
A single examiner (D.C.B.) conducted all 8-year evaluations10 and did not review a child's 1-year test results before conducting an evaluation. The evaluation included the following tests.
1. Wechsler Intelligence Scale for Children, Third Edition.11 This test yields 3 IQ scores: Full Scale, Verbal, and Performance. Each score is normed to have a mean of 100 and an SD of 15. Thus, a score
85 is 1 or more SDs below the population mean and therefore was considered to be low.
2. Wechsler Individual Achievement Test.12 This test of key domains of academic achievement yields standard scores (means of 100, SD of 15) for Composite Reading and Composite Mathematics. These 2 composite scores were averaged to derive an overall Achievement score.
Statistical Analyses
Pearson correlation coefficients and linear regression methods were used to estimate the associations among the 3 neurodevelopmental test scores obtained at 1 year (MDI, PDI, and percent Novelty Preference) and 4 developmental test scores obtained at 8 years (Full-Scale IQ, Verbal IQ, Performance IQ, and Achievement). For each linear regression analysis, a 95% confidence interval (CI) was calculated for the predicted score at 8 years of age given the test score at 1 year of age. The odds (and 95% CI) of having a low Full-Scale IQ score at 8 years of age (
85), given a low test score at 1 year (
84 for MDI and PDI, <53% for novelty preference) was computed. In addition, for each 8-year outcome, the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated for low scores on each of the three 1-year tests.
| RESULTS |
|---|
|
|
|---|
At age 8, 165 of the 171 enrolled children were alive. Of the 160 children living in the United States, 155 (97%) were evaluated, including all 135 children whose data were included in analyses of developmental data obtained at 1 year of age.
The mean scores (± SD) on the MDI, PDI, and Novelty Preference were 104.9 ± 14.5, 95.2 ± 15.6, and 58.5 ± 7.7, respectively. Twelve (9%) children had MDI scores
84, 28 (21%) children had PDI scores
84, and 23 (23%) children had Novelty Preference scores <53%.
BSID Scores and 8-Year Outcomes
Both MDI (Fig 1) and PDI (Fig 2) scores were significantly associated with IQ and achievement test scores at age 8, although the correlations were modest in magnitude, ranging from 0.33 (Full-Scale IQ and MDI) to 0.16 (Achievement and PDI). One-year MDI or PDI scores thus accounted for, at most, 10% of the variance in these 8-year outcomes. For all pairs of 1-year and 8-year scores, the 95% CI for the predicted 8-year scores was wide. In other words, among children who achieved a given score at 1 year, considerable variability would be expected in the IQ and achievement scores achieved at 8 years.
|
|
85) at age 8 was 2.5 times the risk among children with MDI scores >85 (95% CI: 0.88.6; sensitivity: 16%; specificity: 93%; PPV: 42%; NPV: 78%). Among children with a low PDI score at 1 year, the estimated odds of having a low Full-Scale IQ score
85 at 8 years was 2.1 (95% CI: 0.95.3; sensitivity: 31%; specificity: 82%; PPV: 36%; NPV: 79%). Among children with low scores on both MDI and PDI at 1 year (n = 5), the estimated odds of having a Full-Scale IQ score
85 at 8 years old was 5.2 (95% CI: 0.832.4).
We also calculated the odds of having a low Full-Scale IQ at 8 years using an MDI score
96 and a PDI score
94 as cutoffs. As noted, because of the upward drift in infants' scores on the 1969 version of the BSID, these scores are
1 SD below contemporary means. For MDI, the estimated odds ratio (OR) was unchanged (2.5), although the 95% CI was narrower (1.15.8) as a result of the larger number of infants classified as having a low score. For PDI, the OR was only 1.1, approximately half the size of the OR obtained using 84 as the cutoff. As with MDI, the 95% CI was narrower (0.52.5).
Fagan Test of Infant Intelligence Novelty Preference Score and 8-Year Outcomes
Novelty Preference scores were significantly correlated with Full-Scale (P = .04) and Performance IQ (P = .04) scores at age 8 and marginally correlated with Verbal IQ (P = .055) and Achievement (P = .06) scores. The correlations were modest (0.190.21), indicating that the amounts of variance shared between Novelty Preference score and all 8-year outcomes were <5%. Among the 23 children who achieved "at risk" scores at age 1 year, the OR for achieving a Full-Scale IQ score of 85 or lower was 1.9 (95% CI: 0.75.2; sensitivity: 32%; specificity: 80%; PPV: 35%; NPV: 78%). The odds of having a full-scale IQ
85 at 8 years given low scores on all 3 infant test scores (MDI, PDI, and Novelty Preference; n = 3) was 6.4, but the 95% CI was very wide (0.674.4).
| DISCUSSION |
|---|
|
|
|---|
84) were between 2 and 3 times as likely as children with scores of 85 or greater to achieve a Full-Scale IQ score at 8 years that was 1 or more SDs below the population mean. In general, a low MDI score was more predictive of developmental difficulties at 8 years of age than were either a low PDI score or a low Novelty Preference score on the Fagan Test. The greater prediction of later IQ afforded by the MDI score than the PDI score is not surprising, given that, at 1 year of age, the PDI score reflects largely a child's gross motor function. The fact that the MDI score was more predictive than the Novelty Preference score is somewhat more surprising in light of the claim that scores in infancy on information processing tests are better predictors of later IQ than are scores on sensory-motor tests. Although these 3 scores varied somewhat in their characteristics as screening tests, all had poor sensitivity (16%32%) and poor PPV (35%42%). However, all had reasonably good specificity (80%93%) and NPV (78%79%). This pattern suggests that a relatively small subset of children who scored 1 or more SDs below the population norm at age 1 year continued to score poorly at 8 years of age. Furthermore, a large majority of children who scored in the normal range at 1 year also scored in the normal range at 8 years. Perhaps most important, however, the relatively low PPVs of 1-year test scores indicates that the majority of children with reduced performance at 8 years had test scores that were in the normal range at 1 year. Therefore, although children with low scores at 1 year should be placed under the closest developmental surveillance, children with higher scores should also be monitored for indications of emerging problems.
The predictive utility of the BSID, a test of sensory-motor skills, did not differ appreciably from the predictive utility of the Fagan Test, an information-processing test. The OR associated with a low MDI score at 1 year was
30% higher than the OR associated with a low Fagan Test score (2.5 vs 1.9). Although a low score on the Fagan test had twice the sensitivity of a low MDI score, it had somewhat lower specificity and PPV. The screening test statistics associated with a low PDI score were very similar to those associated with a low score on the Fagan Test. It seemed that accuracy of prediction was greater when classification was based on MDI and PDI scores together or on MDI, PDI, and Fagan Test scores together. For both classification strategies, the ORs exceeded those obtained when classification was based on a single test score. Because of the relatively small number of children who achieved low scores on >1 test, however, the upper bounds of the CIs for these ORs were very wide (up to 74). In a larger sample than ours, joint classifications might have proved to yield significantly more accurate predictions of 8-year scores than classifications based on single test scores.
We used the expected mean and SD published in the manual for the 1969 version of the BSID to identify children with low scores at 1 year. As noted earlier, the upward drift over time in children's scores on this test has resulted in a current expected mean of
112 for the MDI and 110 for the PDI.8 Although the secular trend in test scores does not affect the internal validity of our findings, it does mean that our definition of a "low" score at age 1 year (<84) corresponds to >1.5 SDs below the expected score rather than 1 SD. Defining a low MDI score as one that was
96 (16 points lower than 112) afforded prediction that was the same as that obtained using
84 as the cutoff (ie, ORs of 2.5). This was not the case when a low PDI score was defined as one that was
94 (16 points lower than 110), as the OR was only 1.1 (vs 2.1 using the lower cutoff).
Our findings are similar to those of other studies of the predictive utility of scores achieved on neurodevelopmental tests by children at increased medical risk. In a study of 196 low birth weight, premature infants,13 low (
84) scores on the MDI of the BSID at 4 months of age had the following screening characteristics for predicting cognitive delay between 3 and 8 years of age: sensitivity, 39%; specificity, 92%; PPV, 52%; and NPV, 87%. In a study of 196 infants for whom Novelty Preference scores on the Fagan Test at 7 to 10 months of age and IQ scores at 5 years of age were available,14 the correlations between test scores were 0.20 and 0.16 at the 2 clinics that contributed study subjects. The sensitivity of novelty preference scores for identifying children with IQ scores below the 10th percentile was 0 at 1 clinic and 22% at the other.
In our study cohort, fewer than half of the children who had low test scores at 1 year had low test scores at 8 years (true positives), and nearly all of those who had IQ scores within normal limits at 8 years had scores within the normal range at 1 year (true negatives). More important, however, more than half of the children who had low IQ scores at 8 years did not have low scores at 1 year (false negatives). Targeting children with low scores at 1 year for intervention services will capture a substantial subgroup of the children at increased risk of cognitive problems at school age. At the same time, however, it is clear that conclusions drawn about the long-term developmental outcomes of children with congenital heart disease based only on evaluations conducted at 1 year would, for many children, be incorrect. Long-term follow-up remains necessary to discern the contributions of preoperative, intraoperative, and postoperative factors to children's developmental risk.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Reprint requests to (D.C.B.) Department of Neurology, Farley Basement, Box 127, Children's Hospital Boston, 300 Longwood Ave, Boston, MA 02115. E-mail: david.bellinger{at}childrens.harvard.edu
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. Wernovsky Improving neurologic and quality-of-life outcomes in children with congenital heart disease: Past, present, and future J. Thorac. Cardiovasc. Surg., February 1, 2008; 135(2): 240 - 242. [Full Text] [PDF] |
||||
![]() |
J. W. Newburger, R. A. Jonas, J. Soul, B. D. Kussman, D. C. Bellinger, P. C. Laussen, R. Robertson, J. E. Mayer Jr, P. J. del Nido, E. A. Bacha, et al. Randomized trial of hematocrit 25% versus 35% during hypothermic cardiopulmonary bypass in infant heart surgery J. Thorac. Cardiovasc. Surg., February 1, 2008; 135(2): 347 - 354. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Karsdorp, W. Everaerd, M. Kindt, and B. J.M. Mulder Psychological and Cognitive Functioning in Children and Adolescents with Congenital Heart Disease: A Meta-Analysis J. Pediatr. Psychol., June 1, 2007; 32(5): 527 - 541. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Gaynor, G. Wernovsky, G. P. Jarvik, J. Bernbaum, M. Gerdes, E. Zackai, A. S. Nord, R. R. Clancy, S. C. Nicolson, and T. L. Spray Patient characteristics are important determinants of neurodevelopmental outcome at one year of age after neonatal and infant cardiac surgery J. Thorac. Cardiovasc. Surg., May 1, 2007; 133(5): 1344 - 1353. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Gaynor, G. P. Jarvik, J. Bernbaum, M. Gerdes, G. Wernovsky, N. B. Burnham, J. A. D'Agostino, E. Zackai, D. M. McDonald-McGinn, S. C. Nicolson, et al. The relationship of postoperative electrographic seizures to neurodevelopmental outcome at 1 year of age after neonatal and infant cardiac surgery J. Thorac. Cardiovasc. Surg., January 1, 2006; 131(1): 181 - 189. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. Schultz, G. P. Jarvik, G. Wernovsky, J. Bernbaum, R. R. Clancy, J. A. D'Agostino, M. Gerdes, D. McDonald-McGinn, S. C. Nicolson, T. L. Spray, et al. Effect of congenital heart disease on neurodevelopmental outcomes within multiple-gestation births J. Thorac. Cardiovasc. Surg., December 1, 2005; 130(6): 1511 - 1516. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||