From the Johns Hopkins Bloomberg School of Public Health, Department of Health Policy and Management, Baltimore, Maryland
| ABSTRACT |
|---|
|
|
|---|
Methods. Children who were younger than 2 years were followed up
4 years later, after they had entered school. The likelihood of significant behavior problems at follow-up was estimated in multivariate analyses that controlled for baseline spanking frequency and other characteristics. Participants were mothers from a large-scale national study and their children. Statistical analysis included an ethnically diverse sample of 1966 children aged 0 to 23 months at baseline. Two dichotomous indicators of behavior problems were used. The first indicated that maternal rating of child behavior problems exceeded a threshold. The second indicated that a mother met with a school administrator to discuss her child's behavior problems.
Results. White non-Hispanic children who were spanked more frequently before age 2 were substantially more likely to have behavior problems after entry into school, controlling for other factors. For Hispanic and black children, associations between spanking frequency and behavior problems were not statistically significant and were not consistent across outcome measures.
Conclusion. Among white non-Hispanic children but not among black and Hispanic children, spanking frequency before age 2 is significantly and positively associated with child behavior problems at school age. These findings are consistent with those reported in studies of children older than 2 years but extend these findings to children who are spanked beginning at a relatively early age.
Key Words: spanking physical punishment behavior problems parenting
Abbreviations: NLSY-MC, National Longitudinal Survey of Youth Mother-Child Sample BPI, Behavior Problems Index ME, marginal effect
In the United States, spanking is 1 of the most widely used practices for disciplining preschool-aged children. Approximately 94% of 3- and 4-year-old children have been spanked at least once during the past year.1 Although not as common as among older children, a substantial minority of parents report spanking infants and toddlers. In a national survey conducted by the Commonwealth Fund, 11% of parents reported having spanked a child 6 to 11 months of age, 36% reported having spanked a child 12 to 17 months of age, and 59% reported having spanked a child 18 to 23 months of age.2 This raises the concern that spanking a child at these ages may not achieve the benefits claimed for its use with children aged 2 and older and could negatively affect developmental transitions that take place before age 2.
Although there have been numerous empirical studies of spanking's effects, almost all previous studies have been based on children aged 2 and older. In reviews of this research,3,4 Larzelere concluded that on balance, the evidence suggests that "nonabusive" and "customary" use of spanking for children 2 to 6 years has neutral effects on child emotional well-being and may have beneficial effects on child behavior, because it may increase child compliance and it may reduce the need for future punishments by deterring children's misbehavior. Also, citing research showing a positive association of spanking frequency with child behavior problems for white (ie, European American) children but not for children from African American backgrounds, Larzelere concluded that the effects of spanking probably depend on how spanking is used and on its "normative acceptance" within particular cultures. These conclusions have been endorsed by several observers,57 whereas opposing opinions have been expressed by others,811 including many pediatricians.12
Even if one accepts these conclusions of earlier reviews, it may be invalid to extend conclusions that are based on evidence for children aged 2 and older to children younger than 2. Compared with older children, infants and young toddlers have relatively limited capacity to understand the intent of and rationale for punishment, they have limited capacity for verbal communication, and they have limited capacity to plan behaviors that comply with parents' verbal directives. These factors suggest that infants and young toddlers could be particularly vulnerable to emotional trauma and stress as a result of punishment, including spanking, because developmentally they are less equipped to understand punishment and to change their behavior to comply with their parents' expectations.
Moreover, introduction of physical punishment before age 2 could be riskier than at older ages, because before age 2, children undergo several fundamental transitions in emotional and cognitive development, including the initiation of relationships with adult caregivers and the development of a sense of reliance on adults for safety and security.13,14 If spanking is applied too frequently or too severely during these transitions or if its use is increased by other aversive circumstances within a household, such as acrimony, stress, or poverty, then it could inhibit a child's development of trust and feelings of security with parents. This possibility is a concern, because establishment of an emotionally secure and affectionate relationship with parents early in childhood is thought to influence the quality of their later interactions,1417 and because children who do not develop a secure attachment to their parents before age 2 are at greater risk for adjustment problems at preschool ages and later, including persistent behavior problems and poorer competence with peers.1821
These considerations suggest the need for empirical evidence on the association of spanking before age 2 with later child behavior problems. As far as we are aware, only 1 study of spanking and later child behavior problems has even included children younger than 2. Johannesson22 found no relationship between the extent of spanking ("never" to "constantly") at the study baseline, when children were 2 years of age or younger, and teacher ratings of child conflict with peers at ages 10 to 13 years. However, that study did not provide information that is relevant to children in the United States, because it was based on a small convenience sample of Swedish children (N = 212) who were born in the mid-1950s, and did not adjust for other possibly important influences such as child temperament and family socioeconomic status.
In this article, we explore the hypothesis that spanking frequency before age 2 is positively associated with the probability of having significant behavior problems 4 years later, after entry into school. Behavior problems after the transition into school is of interest because the presence of significant behavior problems early in one's school career is a risk factor for several adverse outcomes, including school failure, need for special educational services, and mental health problems.2327 In multivariate analyses, we controlled for several factors that could confound the importance of spanking as a risk factor for child behavior problems. Nevertheless, it is important to state at the outset that the evidence presented here is correlational and therefore can neither confirm nor rule out a causal effect of spanking on later child behavior.
| METHODS |
|---|
|
|
|---|
75 000 dwellings, with oversampling of black, Hispanic, and economically disadvantaged white households. Beginning in 1986 and continuing every 2 years, the NLSY has administered supplementary interviews to all female respondents who have children. These supplements include all children younger than 21. The interviews focus on the home environment and on child development. Data collected during these interviews compose the NLSY-MC. Our analysis included children younger than 24 months at the time of interview in 1986, 1988, 1990, 1992, or 1994. This interview was chosen as this study's baseline. Interviews that occurred
4 years later, ie, the second biennial interview after baseline in 1990, 1992, 1994, 1996, or 1998, became this study's follow-up. The NLSY-MC has 4845 children younger than 24 months at baseline. From these children, 2879 were excluded from the analysis (Table 1). Follow-up interviews were not completed for 1010 children. Also, we excluded from our analysis 1104 children who had not yet attended preschool or elementary school at the time of the follow-up interview so that we could examine a school-based measure of behavior problems. Other substantial exclusions were attributed to missing values for child behavior problems (477 children), use of spanking (101 children), and temperament score (100).
|
Measures
Behavioral Outcomes
We used 2 different measures of significant child behavior problems at follow-up. First, we created a dichotomous indicator for scores exceeding a threshold score on the Behavior Problems Index (BPI).29 The BPI was adapted from the Child Behavior Checklist30 and other scales.28 It aggregates mother ratings of the frequency of 27 child behavior problems during a 1-year reference period for children at least 4 years of age. Bradley et al31 reported a test-retest correlation for the total BPI of .92 and a Cronbach
of .89 among young children in the NLSY-MC. The BPI was validated against data from the 1981 National Health Interview Survey.28 Using standardized BPI scores, we created an indicator equal to 1 for children with BPI standardized scores >119, which represents approximately the top 10% of children in the United States ranked by ratings of behavior problems. Thus, children with BPI scores >119 have more frequent and/or more severe behavior problems than 90% of children in the United States.
The second measure of behavior problems was a dichotomous indicator equal to 1 when a mother reported that she had ever been asked to come to her child's school to discuss her child's behavior problems. The precise wording of the interview question is, "Has your child ever had any behavior problems at school resulting in your receiving a note or being asked to come in and talk to the teacher or principal?" This measure was included in the analysis because of concern that BPI ratings may be subjective and therefore unreliable as a measure of actual child behavior. Compared with the BPI, the school problems measure presumably leaves less room for a subjective response.
Spanking Frequency
Spanking frequency in the previous week was reported by mothers. The questionnaire asks mothers whether they spanked their child in the past week, and a follow-up question asks mothers to report how many times they spanked their child. The NLSY does not request information on spankings delivered by fathers, father figures, or other adults in the household and does not ask about the severity or type of spanking used by parents.
Infant Temperament
Child temperament at baseline was rated by mothers using the How My Infant Usually ActsDifficult Temperament scale, which was developed specifically for the NLSY on the basis of Rothbart's Infant Behavior Questionnaire32 and other temperament scales.33 A higher score indicates a more difficult temperament (or perception thereof). Although little information is available on the validity and the reliability of this measure, it has been demonstrated that the NLSY's infant temperament scales are significantly correlated with parent ratings of behavior problems among school-aged children.33
Parent-Child Interaction
The frequency of parent-to-child reading and interviewer observations of positive mother-child interactions served as proxy measures of cognitive stimulation and emotional support, respectively. Mothers reported whether they read books to their child "never," "several times a year," "several times a month," "once a week," "about 3 times a week," or "every day." Greater reading frequency presumably indicates greater stimulation and may also be indicative of more frequent positive parent-child interaction. A dichotomous indicator for frequent readingie, "about 3 times a week" or "every day"was created. Approximately 54% of mothers met this condition. The frequency of positive parent-child interactions was also recorded by NLSY interviewers. Specifically, interviewers recorded whether during the baseline interview mothers "spontaneously spoke to child twice or more (excluding scolding)"; "responded verbally to child's speech"; and/or "caressed, kissed, or hugged child at least once." On the basis of these interviewer observations, a dichotomous indicator for mothers who exhibited all 3 of these behaviors was created. Approximately 60% of mothers met this condition.
Other Parent, Child, and Household Characteristics
Maternal age and marital status, child age and gender, household income, and indicators for year of interview were also included as explanatory variables. Low maternal age at birth has been associated with several risk factors for poor child outcomes, including authoritarian child-rearing practices34 and greater abruptness in parent-child interactions.35 Household income and mother's educational attainment are proxies for a variety of differences between households, including family stress level, quality of the home environment, and child behavior.3638 Marital status, especially father absence from the household and previous divorce, has been associated with behavior problems in young children and with parent stress level.3941
Analyses
Probit models42 were estimated for each dependent variable using both the full sample and 3 racial/ethnic subsamples: white non-Hispanics, black non-Hispanics (henceforth African Americans), and Hispanics. The importance of separate analyses by race/ethnicity was indicated by previous research showing racial/ethnic differences in the association of physical discipline with child behavior problems, with associations tending to be positive for European Americans but not for African Americans.4346 Likelihood ratio statistics47 were used to test the null hypothesis that probit model coefficient estimates were the same across the 3 racial/ethnic subsamples. Rejection of (or failure to reject) this hypothesis was used to determine whether separate analyses by race/ethnicity provided a better fit to the data.
Spanking frequency, ie, the reported number of spankings in the past week, was entered in the probit models as a quadratic term by including both spanking frequency and the square of spanking frequency. This specification was suggested by past research showing that the relationship between spanking frequency and child behavior problems is nonlinear, with larger marginal effects observed at greater spanking frequencies, ie, a convex relationship between frequency and behavior problems.4,44 We were also concerned that spanking frequency could be related to qualitative differences in the type of spanking that occurred. For example, research suggests that slaps that are light to moderate in severity have a qualitatively different and perhaps less adverse effect on children than more severe slaps.4 In the NLSY-MC data, mothers who reported very high spanking frequencies (eg, 20 in the previous week) could have been more likely to have been using frequent but relatively light slaps than mothers who reported only 1 or 2 spankings. If this is the case, then we would expect to observe a concave relationship (ie, a positive but decreasing association) between spanking frequency and child behavior problems.
For ease of interpretation, probit regression coefficient estimates were converted to "marginal effects": the estimated change in proportion of children who exhibited the outcome of interest given a change in the value of an explanatory variable.48 For continuous explanatory variables (eg, spanking frequency, age, income, temperament ratings), the marginal effect (ME) represents the estimated effect of a 1-unit increase in value of a predictor on the expected proportion of children who had BPI scores >119 and/or who had behavior problems in school that required a parent-teacher meeting; for dichotomous indicator variables, the ME represents the estimated effect of a change in value of an indicator from 0 to 1 on the proportion of children with those outcomes.
Finally, probit model variances were estimated using the Huber-White, or sandwich, variance estimator4951 to adjust for clustering effects. Some measures could have been correlated across observations, because the sample included >1 child per mother on average. Specifically, of the 1540 mothers represented in the sample used here, 1165 (75.7%) mothers had 1, 330 (21.4%) mothers had 2, 39 (2.5%) mothers had 3, and 6 (0.4%) mothers had 4 children in the sample. The adjusted variances adjust for intracorrelation by applying unequal weights to observations in different clusters and equal weights within each cluster.
| RESULTS |
|---|
|
|
|---|
|
2 = 9.807, P = .007) and with the proportion of children with BPI scores >119 (
2 = 8.901, P = .012). However, associations differed across the 3 racial/ethnic subgroups. In white families, spanking frequency was significantly and positively related to the proportion of children with behavior problems that required a parent-teacher meeting (
2 = 17.768, P < .001); in African American families, the association was slightly positive and not statistically significant (
2 = .666, P = .717); and in Hispanic families, the association was slightly negative and not statistically significant (
2 = .010, P = .995). Also, in white families, spanking frequency was positively and significantly associated with the proportion of children with BPI scores >119 (
2 = 6.662, P= .036); in African American families, there was no apparent association of spanking frequency with BPI scores >119 (
2 = .544, P = .762); and in Hispanic families, the association was positive but not statistically significant (
2 = 1.074, P = .584).
|
2[16] = 40.398, P = .0007 in the model for behavior problems requiring a parent-teacher meeting, and
2[16] = 30.981, P = .0135 in the model for BPI scores >119), indicating that separate models for each racial/ethnic subgroup provided a better fit to the data.
|
|
Probability of BPI >119
In Table 5, among white non-Hispanic children, the coefficient estimate for spanking frequency at baseline was positive and statistically significant (ME: 0.044; P = .003), and the coefficient estimate for the square of spanking frequency was negative and statistically significant (ME: 0.005; P = .022), indicating a positive and concave relationship of spanking frequency with the probability of a BPI rating >119 at follow-up. Among African American children, spanking frequency was positively associated with risk for a BPI rating >119, but the coefficient estimate was not statistically significant (ME: 0.023; P = .251). The coefficient estimate for the square of spanking frequency (ME: .002; P = .871) also was not significant. Among Hispanic children, spanking frequency was positively associated with risk for a BPI rating >119, but the coefficient estimate was not statistically significant (ME: 0.027; P = .253). The coefficient estimate for the square of spanking frequency (ME: 0.006; P = .234) also was not significant.
Predictions From Multivariate Estimates
Figures 1 and 2 summarize the multivariate estimates (from Tables 4 and 5) of the relationship of spanking frequency with the proportion of children whose behavior problems required a parent-teacher meeting and with the proportion of children with a BPI rating greater than the 90th percentile of the US child population (BPI >119), respectively. Spanking frequencies ranging from 0 to 5 times in the past week were used in predictions; this range represents the central 95% of the spanking frequency distribution reported in our NLSY-MC sample.
|
|
2 spankings in the previous week; at greater spanking frequencies, the relationship was negative. One other characteristic of the associations represented in Figs 1 and 2 is notable. The probit models predicted that, compared with African American and Hispanic children, white children were relatively unlikely to meet either of the 2 criteria for behavior problems when they were not spanked. However, when spanking was relatively frequent, the proportion of white children who were predicted to have behavior problems that required a parent-teacher meeting was greater than the proportion of African American and the proportion of Hispanic children who were predicted to meet that criterion; also, when spanking was relatively frequent, the proportion of white children who were predicted to have BPI ratings >119 was greater than the proportion of Hispanic children and was similar to the proportion of African American children who were predicted to exceed that threshold.
| DISCUSSION |
|---|
|
|
|---|
4 years later, after they had entered school. Among children in this sample, spanking frequency before age 2 was a substantial predictor of a child's risk for behavior problems at school age for white non-Hispanic children only. For example, marginal effects estimates suggested that compared with the white non-Hispanic children who were not spanked in a week-long period before age 2, white non-Hispanic children who were spanked 5 times in a week had an
4.2-times greater risk of behavior problems that required a parent-teacher meeting 4 years later (0.121 vs 0.029) and had an
2.3-times greater risk of maternal ratings of behavior problems exceeding the 90th percentile 4 years later (0.159 vs 0.068). Among Hispanic and African American children, spanking frequency before age 2 was not consistently associated with child behavior 3 years later, although some results (shown in Fig 2) were suggestive of a positive association in these families. These associations are consistent with those reported in previous studies of European American and African American children older than 2. Deater-Deckard et al43 reported that spanking was positively correlated with child externalizing behaviors in subsequent years among European American children but was not significantly correlated with behavior problems among African American children. Gunnoe and Mariner45 reported that spanking frequency at ages 5 to 11 was significantly associated with an increase in fighting 5 years later among European American children but was also significantly related to a decrease in fighting among African American children. Finally, McLeod et al,46 who also used data from the NLSY-MC, reported that spanking frequency at ages 4 and above predicted significantly greater antisocial behavior among European Americans 2 years later, but the association was not statistically significant among African Americans.
Several possible explanations for these differences in association have been proposed.4,44 Spanking is thought to have greater "normative acceptance" in African American families,7,52,53 which could mean that African American children and parents are relatively less likely to perceive spanking as harsh or unfair. Alternatively, in white non-Hispanic communitieswhere spanking is used less frequently and where its use is thought to be more stigmatizingfrequent spanking before age 2 could be associated with other factors that indicate relatively greater developmental risk, such as high parent stress. Both interpretations are consistent with our finding (shown in Fig 1) that compared with African American and Hispanic families, child behavior problems that require a parent-teacher meeting were predicted to be relatively less common in white non-Hispanic families when spanking before age 2 was infrequent, whereas the relative ordering was the reverse in families in which spanking was relatively frequent. However, in predictions of risk for behavior problem ratings above the 90th percentile (Fig 2), African American children were predicted to be at greater risk at all spanking frequencies, suggesting that in both groups, more frequent spanking was associated with greater behavioral risk.
With respect to the proposition that spanking children younger than 2 is potentially detrimental to their development, our findings should be viewed cautiously. First, although greater spanking frequency was a substantial risk factor for behavior problems in white non-Hispanic families, even relatively frequent spanking before age 2 did not represent an overwhelming risk for significant behavior problems 4 years later. More than 87% of white non-Hispanic children in this sample who were spanked 5 times in a week-long period before age 2 were not expected to have behavior problems that require a parent-teacher meeting, and >84% of these children were not expected to have behavior problem ratings exceeding the 90th percentile of the US population.
In addition, the significant associations found for white non-Hispanic children could be partially or completely attributable to correlation of spanking frequency with other problems in family functioning and home environment that adversely affect child outcomes. Although we adjusted for several observable factors that may be confounded with a causal effect of spankingincluding mother's educational attainment and marital status, child temperament, family income, independent observation of parent-child interaction, and frequency of parent-child readingmany other aspects of parent-child interactioneg, the emotional tone of the household, other aspects of parenting, the category of child-parent attachmentwere not captured in our study. For example, Larzelere4 emphasized the importance of measuring all forms of punishment, not just spanking, although other forms of punishment, such as time-outs, may be far less common among children younger than 2. For these reasons, there is no way to tell whether the significant association of spanking frequency before age 2 and behavior problems in white non-Hispanic children was indicative of a causal relationship between spanking and child outcome.
Nevertheless, it is worth pointing out that spanking is 1 concrete experience that mediates interactions between parents and children and therefore is both a proxy for characteristics within a family that promote its use and a modifierfor better or worseof subsequent developmental risks. Moreover, because the specific context within which spanking (and other forms of punishment) is used often involves parent feelings of irritation, anger, or frustration, which can be mild or more severe, children and parents could associate spanking with underlying sources of intrafamily conflict that promote its use, eg, negative parent-child attitudes, parent level of distress, marital discord.5459 These factors might also influence how spanking is ultimately perceived by both parties and therefore its subsequent effects on children.
Finally, our results suggest that studies of children older than 2 need take into account the possible effects of spanking and other punishments that began at earlier ages, although most have not. We found that
40% of families reported spanking their infant or toddler at least once in the previous week, and spanking frequency at this age was associated with child behavior problems at an older age. Thus, in studies of children older than 2, failure to account for effects of punishments from earlier ages could result in incorrect inferences about the effects of punishments received at ages beyond 2. In fairness, although previous evidence suggests that spanking is more normative and not detrimental at ages 2 to 6 years,4 the converse argument can be applied to this study in that we do not account for the possible effects of spanking received after age 2. However, that actions that take place during parent-child interactions are often recurrent indicates a need for research on this topic that models the dynamics of child behavior and parent response, starting at a relatively early point in childhood.
Limitations
Several limitations of the data and of the analysis may affect interpretation of these results. First, fathers' and other adults' participation in spanking children was not measured, so it is not known whether children's behavioral outcomes were associated with spanking by mothers, fathers, or both. This limitation of the data also may have influenced direction and magnitude of the estimated associations between spanking frequency and behavior problems; for example, compared with other children, children who were not spanked in the previous week might have been more likely to be spanked by fathers, and these unreported spankings may have influenced these children's later behavioral outcomes relative to children who are spanked more frequently. Second, no information was available on other punishments used by parents, although among children younger than 2, other forms of punishment are unlikely to have been common. Other forms of punishment may also have been associated with subsequent child behavior problems, but it was not possible to explore those associations using these data. Third, ratings of behavior problems were reported by mothers and were not confirmed by independent observation. However, subjective factors that may have affected parent ratings of child behavior may have had less influence on parent report of factual events, such as the occurrence of a parent-teacher meeting to discuss child behavior. Fourth, it is notable that fewer than half of the 1966 children were in the Hispanic and African American subsamples, suggesting that in these subsamples there was limited power to detect associations of spanking frequency with behaviors that applied only to 8% to 13% of the overall sample.
Fifth, many age-eligible children in the NLSY-MC data were excluded from this analysis because of missing information, representing 59.5% of age-eligible children. Most of these exclusions resulted from the absence of a follow-up interview, discontinuation of the "economically disadvantaged white" subsample after 1990, and the restriction that children had to have attended school before the age of follow-up. These factors tended to bias the sample toward greater African American and Hispanic composition, toward slightly greater average socioeconomic status, and toward greater average child age. These biases might have reduced the magnitude of the association of spanking frequency with child behavior problems, because spanking frequency and behavior problems are likely to have been relatively more common among low-income whites whose children were excluded from the sample.
| CONCLUSION |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Reprint requests to (E.P.S.) 624 North Broadway, Rm 433, Baltimore, MD 21205-1901. E-mail: eslade{at}jhsph.edu
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
K. J. Ripoll-Nunez and R. P. Rohner Corporal Punishment in Cross-Cultural Perspective: Directions for a Research Agenda Cross-Cultural Research, August 1, 2006; 40(3): 220 - 249. [Abstract] [PDF] |
||||
![]() |
A. Grogan-Kaylor Corporal Punishment and the Growth Trajectory of Children's Antisocial Behavior Child Maltreat, August 1, 2005; 10(3): 283 - 292. [Abstract] [PDF] |
||||
![]() |
Spanking infants is associated with behaviour problems BMJ, July 24, 2004; 329(7459): . [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||