ARTICLE |
a Robert Wood Johnson Clinical Scholars Program
b Department of Pediatrics
c Child Health Institute, University of Washington, Seattle, Washington
d Veterans Affairs Outcomes Group, White River Junction, Vermont
e Center for the Evaluative Clinical Sciences, Dartmouth Medical School, Hanover, New Hampshire
| ABSTRACT |
|---|
|
|
|---|
METHODS. We obtained data from the National Newborn Screening and Genetics Resource Center regarding the screening practices for each of the 50 states, to determine the number of mandated disorders added to state newborn screening panels between 1995 and 2005. Combining these data with reported specificities from the literature and the number of births in each state, we estimated the number of infants who would have received false-positive results through screening with tandem mass spectrometry in 2005.
RESULTS. The average state mandated screening for 5 disorders in 1995 (range: 08 disorders). Wyoming was the only state that decreased its panel size over the next decade. Kansas and Texas were the only states that did not add disorders to their panels between 1995 and 2005; the average state added 19. Iowa, Minnesota, Mississippi, South Dakota, and Tennessee each added
40 disorders. Assuming that an individual test for a disorder had a specificity of 99.995%, we estimated that
2575 infants would have received false-positive results through screening with tandem mass spectrometry in 2005. If specificity was assumed to be 99.9%, then the number increased to >51000.
CONCLUSIONS. State newborn screening programs have expanded dramatically in the past decade. Because the benefit of such testing may be unclear in some cases and because the number of infants who may receive false-positive results and may be labeled falsely as having disease is potentially sizeable, a more cautious approach is needed.
Key Words: newborn screening metabolic disorders false-positive results
In the United States, decisions about newborn screening are made by 50 different state legislatures. Shortly after Massachusetts began a mass voluntary screening program for phenylketonuria in 1962, other states passed legislation mandating newborn phenylketonuria screening.1 Currently, all states have laws mandating that every newborn be screened for selected inborn errors of metabolism and endocrine disorders (among the most common are phenylketonuria, congenital adrenal hyperplasia, galactosemia, homocystinuria, hypothyroidism, and maple syrup urine disease).2 However, because each state maintains the authority to decide which tests are mandated, the disorders screened vary from state to state.
Approximately 1 decade ago, the advent of tandem mass spectrometry made it possible for states to test newborns for multiple disorders more efficiently. Previous testing methods, such as chromatography and bacterial inhibition assays, were able to detect only 1 disorder at a time. Now, with a single drop of dried blood, a tandem mass spectrometer can detect numerous metabolites that accumulate with various inborn errors of metabolism.3,4 Tandem mass spectrometry is quick (
12 minutes),5 relatively inexpensive ($10$30),5,6 and readily scalable, because increasing the number of conditions screened does not change the time or price substantially.5
How the state newborn screening programs have changed their screening practices in response to the opportunities presented by this technology has not been evaluated systematically. In this article, we examine the newborn screening programs in each of the 50 states and we explore how the number of mandated disorders screened has changed in the past decade. We also consider the number of infants who would receive false-positive results and so might be labeled falsely as having disease, given the current number of mandated tests that are performed with tandem mass spectrometry. In light of recent calls for additional research on expanded newborn screening,7,8 we think that this article represents an important first step in understanding the potential scope of false-positive results with expanded newborn screening.
| METHODS |
|---|
|
|
|---|
Our approach to counting the number of disorders screened was as follows: (1) only disorders screened with dried blood spots were counted (eg, hearing tests were excluded), (2) only universally mandated disorders were counted (ie, disorders screened on a voluntary basis, those offered only to select populations, and those detected as a byproduct of screening with multiple-reaction monitoring were excluded), (3) the entry "variant hemoglobins" was excluded from our count because the report did not indicate the number of variants screened, and (4) the counting method was consistent for 1995 and 2005. This latter criterion was necessary because some disorders were recorded as a single disorder in 1995 but as multiple disorders in 2005 (eg, hemoglobinopathies in 1995/sickle cell disease, S-ß-thalassemia, and sickle-C disease in 2005; galactosemia in 1995/classic galactosemia, galactose epimerase deficiency, and galactokinase deficiency in 2005; and phenylketonuria in 1995/benign hyperphenylalaninemia, defects of biopterin cofactor biosynthesis and regeneration, and phenylketonuria in 2005). To make the count consistent across years, and because we were unable to determine from the 1995 National Newborn Screening Report which states screened for the additional disorders listed in 2005, we counted hemoglobinopathies, galactosemia, and phenylketonuria each as 1 disorder in both 1995 and 2005.
When states mandated screening for subtypes and variants, we counted these as individual disorders (eg, cobalamin A/B, cobalamin C/D, and methylmalonyl mutase were counted as 3 separate disorders). We did this because an infant has the potential to be diagnosed with any of these disorders, and this potential varies according to state.
Counting of Individual Tests
We counted each disorder as a single test unless multiple disorders were screened through detection of identical metabolites by tandem mass spectrometry. In such cases, we counted the group of disorders as 1 test. To determine which metabolites are evaluated in screening for the various disorders, we obtained metabolite screening profiles from the Pediatrix Screening Laboratory (www.pediatrix.com), a private laboratory that specializes in newborn screening with tandem mass spectrometry. Profiles not available from Pediatrix were obtained from states that screened for the particular disorder. We found the following disorders to be screened with the same metabolites (grouped according to identical metabolites): phenylketonuria, benign hyperphenylalaninemia, and defects of biopterin biosynthesis and regeneration; sickle cell disease, S-ß-thalassemia, and sickle-C disease; classic galactosemia (transferase deficient), galactose epimerase deficiency, and galactokinase deficiency; tyrosinemia types I, II, and III; homocystinuria and hypermethioninemia; long-chain hydroxyacyl-CoA dehydrogenase deficiency and trifunctional protein deficiency; isovaleric acidemia and 2-methylbutyryl-CoA dehydrogenase deficiency; short-chain acyl-CoA dehydrogenase deficiency, isobutyryl-CoA dehydrogenase deficiency, and ethylmalonic encephalopathy; carnitine acylcarnitine translocase deficiency and carnitine palmitoyl transferase deficiency type 2; 3-hydoxy 3-methylglutaryl-CoA lyase deficiency, 3-methylcrotonyl-CoA carboxylase deficiency, ß-ketothiolase deficiency, multiple carboxylase deficiency, and 3-methylglutaconic aciduria; argininosuccinate acidemia, citrullinemia type I, and citrullinemia type II; methylmalonic acidemias (including cobalamin A/B, cobalamin C/D, and methylmalonyl-CoA mutase) and propionic acidemia.
Specificity of Individual Tests
We performed a literature search for publications that provided specificities for individual tests performed with tandem mass spectrometry for newborn screening. We searched Medline from 1966 to May 1, 2005, using the following search strategies with limits to English-language publications: (1) the MeSH headings "spectrometry, mass, electrospray ionization" and "sensitivity and specificity" and "neonatal screening" and (2) the keywords "tandem mass spectrometry" and "specificity" and "newborn screening." Our search strategy returned 54 citations. We then reviewed the bibliographies of all relevant articles to find additional studies of specificity.
We reported the "gold standard" for each specificity as laboratory confirmation if it entailed serum, urine, or enzymatic analysis or comparison with results of another laboratory technique (such as fluorometry) and as genetic confirmation if it involved DNA analysis. Our findings for the various individual tests are summarized in Table 1.
|
|
Number of False-Positive Results
To estimate the number of expected false-positive tests in each state, we combined data on (1) the number of screening tests performed with tandem mass spectrometry that were mandated by the state in 2005, (2) the 3 scenarios for the specificity of an individual test (as described above), and (3) the number of births in each state (from the National Vital Statistics Reports preliminary birth data for 2004).9 The vast majority of the specificities from the literature that we used to estimate false-positive results (including the specificity for our best-case scenario) were determined from positive results that required either obtaining a repeat specimen from the newborn or referring the newborn for additional confirmatory testing.
For each state, we used the following equation: no. of false-positive results = [1 [1 (1 prevalence)(1 specificity)]k] x (no. of births in state), where k represents the number of tests performed with tandem mass spectrometry and prevalence refers to the average prevalence for an individual disorder screened with tandem mass spectrometry. We calculated the average prevalence for an individual disorder (1 case per 111997 infants) from a pilot study of expanded newborn screening with tandem mass spectrometry in New England,10 where the overall prevalence for all disorders screened was 1 case per 4870 infants. We used the following equation to calculate the average prevalence for an individual disorder: average prevalence for individual disorder = 1 (1 overall prevalence)(1/no. of disorders).
Because the most recent birth data are from 2004 and the numbers of births are increasing in most states, our estimates of the number of infants who receive false-positive results in 2005 are biased downward. Because the National Newborn Screening Reports published since the expansion of newborn screening do not contain data on the number of false-positive results, we were not able to compare our estimates with the actual number of recorded false-positive results.
| RESULTS |
|---|
|
|
|---|
30 disorders during the period; 5 states (Iowa, Minnesota, Mississippi, South Dakota, and Tennessee) added
40. Figure 1 shows that the states with the most rapid growth in testing were generally in the coastal regions and the Midwest, whereas states in the Great Plains and Rocky Mountain regions displayed more modest growth.
|
|
2575 infants in the United States would receive false-positive results each year and so might be labeled falsely as having disease. With the intermediate scenario (specificity: 99.95%), we estimated that
25644 infants would receive false-positive results. With the worst-case scenario (specificity: 99.9%), the number increased to >51059 (Table 4).
|
| DISCUSSION |
|---|
|
|
|---|
Our study is limited because the literature on the specificity of newborn screening tests is incomplete. Specificity estimates are simply not readily available for every test now used in state newborn screening panels. Furthermore, frequently the available data are of questionable quality; the standard method may not be specified and/or the reporting may not be explicit (ie, specificity needed to be calculated with values that appeared in various passages in the text). We found no data that addressed the question of whether the probability of an infant receiving a false-positive result for one condition influences the probability of receiving a false-positive result for other conditions.
To deal with these data limitations, we were forced to make a number of simplifying assumptions. First, to deal with the incomplete (and probably unreliable) database on the specificity of individual tests, we assumed the same specificity (derived from reported specificities for tandem mass spectrometry) for every test in a state's panel. Although not all tests are performed with tandem mass spectrometry, the majority are. We addressed the uncertainty about the true specificities by testing a range of values that are plausible, on the basis of the available data. Given the limited data available, we were conservative in our choice of specificities for the best-case and intermediate scenarios, and we are confident that most of the individual specificities, if known, would lie either at or below these specificity estimates. We suspect that the true number of false-positive results, if known, would lie somewhere between our best-case and worst-case scenarios. Second, we assumed that the probabilities of receiving false-positive results for the various tests are independent. We are confident that our most conservative estimate,
2575 cases per year, can be viewed as a best-case scenario for the number of false-positive results that occur currently. However, as states continue to add more tests to their state panels, to enroll a majority of the population in pilot/voluntary programs, and to identify disorders as byproducts of multiple-reaction monitoring screening,2 our predictions will underestimate the number of false-positive results obtained.
We were unable, in the context of this study, to address specifically the consequences experienced by the parents of infants who might have a false-positive test. Generally, when an initial abnormal result is detected, the laboratory contacts the infant's physician, who then notifies the parents.17 The urgency and extent of follow-up testing depend on the abnormality. For some families, the journey may end with a repeated blood spot specimen that yields normal results. We suspect, however, that the journey from initially positive results to subsequently normal results is not an easy one for some families. The ultimate diagnosis may be difficult to achieve.18,19 Positive test results may initiate diagnostic cascades, when confirmatory testing produces an ambiguous result and one test stimulates another as more questionable abnormalities are found.20 Although some have argued that newborn screening will help to avoid this problem later in life,21 there is at least some reason to be concerned that it will increase the number of diagnostic cascades early in life.
Furthermore, studies suggest that false-positive results have sequelae even after subsequent testing confirms that the infant is normal. More than one third of parents still have concerns about the health of their infant,11 on average, mothers report more stress,16 the average parent-child relationship is more dysfunctional,16 and infants are hospitalized more frequently for unrelated illnesses.16 Research suggests that certain steps might lessen the potential anxiety of families that is triggered by a false-positive result.16 It is possible that some parents are not well prepared to deal with the screening results. Studies have demonstrated that education of parents about false-positive results is lacking.22 Moreover, although many states do not require informed consent for newborn screening, a few do not even require parental notification before the initial specimen is obtained.17 As we proceed with newborn screening in this country, we must provide both sufficient and appropriate resources to families.
By focusing solely on false-positive results, we have not considered the infants who have benefited from screening. Although it is not clear that early detection decreases morbidity or mortality rates for all of the disorders being screened,23 some infants may benefit from expanded newborn screening. Furthermore, knowing how many infants are identified as having disease through expanded newborn screening would allow us to place the absolute number of false-positive results in a useful relative context, such as a positive predictive value.24 Unfortunately, recent National Newborn Screening Reports2,25 do not contain data on true-positive results with expanded newborn screening, and producing estimates is difficult because, for many conditions, prevalence data are unknown, vary widely, or are based heavily on case report findings.21
It is also important to note that, although some infants may benefit from screening, other infants may be diagnosed as having pseudodisease, that is, infants whose positive test results are "true" in confirmatory testing but who are diagnosed with a disease that will never become clinically apparent. These infants and their parents suffer the harms of unnecessary diagnosis (with its attendant psychological burden) and unnecessary treatment. Although the concept is relatively unfamiliar, pseudodisease has been reported in the context of newborn screening.2628 Additional research regarding the natural history and phenotypic variation of some of these diseases is needed.
Given some of the uncertainties about benefits and the unintended side effects of false-positive tests and pseudodisease, we urge a more cautious approach to newborn screening. There is much we need to learn, and additional research is critical. As a first step, better estimates of actual specificities achieved in real-world settings are needed. We hope that all states will document both their true-positive and false-positive results. It also will be helpful to examine more closely the course of events after infants receive false-positive results. Their care accrues a cost to newborn screening that needs to be quantified adequately. Finally, research also should be directed toward the questions of benefits and pseudodisease, perhaps by making use of the natural experiment afforded by states with very different screening practices. Because newborn screening can be a double-edged sword, researchers and policymakers must be attentive to both benefit and harm. Undoubtedly, the tradeoffs will be challenging; we must be careful to avoid harming thousands of children inadvertently in an effort to help a few.
| ACKNOWLEDGMENTS |
|---|
We thank Tom Koepsell, MD, MPH, for his helpful suggestions and guidance regarding methods. We thank Wylie Burke, MD, PhD, for her thoughtful comments and suggestions regarding preparation of the manuscript. We thank Michael Glass, MS, Director of Newborn Screening for Washington State Department of Health, for his guidance regarding technical aspects of tandem mass spectrometry and newborn screening. We thank Donna Williams of the National Newborn Screening and Genetics Resource Center for her assistance in obtaining the National Newborn Screening Reports.
| FOOTNOTES |
|---|
Address correspondence to Beth A. Tarini, MD, University of Michigan Health System, General Pediatrics, 300 N Ingalls St, Room 6C11, Ann Arbor, MI 48109-0456. E-mail: btarini{at}umich.edu
The authors have indicated they have no financial relationships relevant to this article to disclose.
The views expressed herein do not necessarily represent the views of the Robert Wood Johnson Foundation, the Department of Veterans Affairs, or the US government.
Dr Tarini had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. A. Chapman, J. Ganesh, and C. Ficicioglu A False-positive Newborn Screening Result: Goat's Milk Acidopathy Pediatrics, July 1, 2008; 122(1): 210 - 211. [Full Text] [PDF] |
||||
![]() |
B. A. Tarini, D. A. Christakis, and H. G. Welch A False-positive Newborn Screening Result: Goat's Milk Acidopathy: In Reply Pediatrics, July 1, 2008; 122(1): 211 - 211. [Full Text] [PDF] |
||||
![]() |
H. Bishop Hubbard Policy Issues Related to Expanded Newborn Screening: A Review of Three Genetic/Metabolic Disorders Policy Politics Nursing Practice, August 1, 2007; 8(3): 201 - 209. [Abstract] [PDF] |
||||
![]() |
B. A. Tarini The Current Revolution in Newborn Screening: New Technology, Old Controversies Arch Pediatr Adolesc Med, August 1, 2007; 161(8): 767 - 772. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. la Marca, S. Malvagia, E. Pasquini, M. Innocenti, M. A. Donati, and E. Zammarchi Rapid 2nd-Tier Test for Measurement of 3-OH-Propionic and Methylmalonic Acids on Dried Blood Spots: Reducing the False-Positive Rate for Propionylcarnitine during Expanded Newborn Screening by Liquid Chromatography-Tandem Mass Spectrometry Clin. Chem., July 1, 2007; 53(7): 1364 - 1369. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Arn Newborn Screening: Current Status Health Aff., March 1, 2007; 26(2): 559 - 566. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. B. Hanley False-positive Results in Expanded Newborn Screening Pediatrics, February 1, 2007; 119(2): 414 - 415. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||