Genome-Wide Association and Exome Sequencing Study of Language Disorder in an Isolated Population
BACKGROUND AND OBJECTIVE: Developmental language disorder (DLD) is a highly prevalent neurodevelopmental disorder associated with negative outcomes in different domains; the etiology of DLD is unknown. To investigate the genetic underpinnings of DLD, we performed genome-wide association and whole exome sequencing studies in a geographically isolated population with a substantially elevated prevalence of the disorder (ie, the AZ sample).
METHODS: DNA samples were collected from 359 individuals for the genome-wide association study and from 12 severely affected individuals for whole exome sequencing. Multifaceted phenotypes, representing major domains of expressive language functioning, were derived from collected speech samples.
RESULTS: Gene-based analyses revealed a significant association between SETBP1 and complexity of linguistic output (P = 5.47 × 10−7). The analysis of exome variants revealed coding sequence variants in 14 genes, most of which play a role in neural development. Targeted enrichment analysis implicated myocyte enhancer factor–2 (MEF2)-regulated genes in DLD in the AZ population. The main findings were successfully replicated in an independent cohort of children at risk for related disorders (n = 372).
CONCLUSIONS: MEF2-regulated pathways were identified as potential candidate pathways in the etiology of DLD. Several genes (including the candidate SETBP1 and other MEF2-related genes) seem to jointly influence certain, but not all, facets of the DLD phenotype. Even when genetic and environmental diversity is reduced, DLD is best conceptualized as etiologically complex. Future research should establish whether the signals detected in the AZ population can be replicated in other samples and languages and provide further characterization of the identified pathway.
- ACES —
- Academic Competence Evaluation Scales
- CNV —
- copy number variant
- DLD —
- developmental language disorder
- GWAS —
- genome-wide association study
- HWE —
- Hardy-Weinberg equilibrium
- MEF2 —
- myocyte enhancer factor–2
- MLM —
- mixed linear modeling
- PCR —
- polymerase chain reaction
- PolyPhen —
- polymorphism phenotyping
- QC —
- quality control
- ROH —
- runs of homozygosity
- SNP —
- single nucleotide polymorphism
- SNV —
- single nucleotide variant
- SVS —
- SNP & Variation Suite
- WES —
- whole exome sequencing
What’s Known on This Subject:
Genetic underpinnings of common forms of pediatric disorders of language are heavily understudied. Recent association studies identified several tentative candidate genes. However, thus far, none of these candidates has received strong support in replication or confirmation analyses.
What This Study Adds:
We established a statistically significant association between SETBP1 and language disorders in a geographically isolated population. Whole exome sequencing convergently implicated the myocyte enhancer factor–2–regulated pathways (of which SETBP1 is part) in language disorders in this special population.
Developmental language disorder (DLD) is a prevalent neurodevelopmental disorder, with 7-10% of children1,2 exhibiting atypical patterns of language development despite not having apparent sensorimotor/cognitive impairments or other medical conditions.3 DLD is lifelong,4 comorbid with other neurodevelopmental5 and psychiatric6 disorders, and associated with adverse academic7 and socio-emotional8 outcomes. It is phenotypically complex and genetically heterogeneous; although highly heritable,9 the etiology and pathogenesis of DLD are poorly understood.
A rare Mendelian type of DLD has been attributed to deleterious variants in the FOXP2 gene10–12 (7q31); however, it is not associated with the disorder’s common forms.13 For the latter, linkage studies have identified 3 susceptibility regions: 16q24, 19q13,14 and 13q2.15 Targeted association studies implicated CNTNAP216 (7q35; downregulated by FOXP2) and CMIP and ATP2C217 (16q) genes in phonological memory deficits. Four genome-wide association studies (GWAS) divulged no genome-wide significant signals,18–21 with the exception of gene-based associations for CDC2L1, CDC2L2, LOC728661, and RCAN3.19 A whole exome sequencing (WES) study of DLD in an admixed Chilean founder population suggested the involvement of a nonsynonymous single nucleotide variant (SNV) in NFXL122; however, its location is not in the previously identified linkage regions in this population.23
This pattern of findings highlights the complexity of DLD’s etiology, driven by the exclusionary nature of the diagnosis, the multicomponential nature of the phenotype, and the heterogeneity of the samples studied. The main objective of the present study was to identify genetic bases of DLD in a unique population (small, geographically secluded, and with an elevated prevalence of DLD [hereafter, the AZ population]) in which genetic and environmental variability is constrained. Genetic profiles of isolated populations are characterized by restricted genetic and allelic heterogeneity, thus rendering them ideal for studying the genetic bases of complex disorders.24
The study population resides in a remote cluster of villages in Russia’s rural north; it was founded in the 15th century by several nuclear families. Currently, the AZ population comprises ∼860 individuals (∼120 children aged 3–18 years). Of these, 74.6% are represented by a set of multigenerational family structures (6391 individuals), of whom 82% are interconnected through a single 11-generational pedigree. The environmental conditions in the population are relatively uniform: all children go to the same kindergarten and school, and the socioeconomic indicators such as parental education and income show little variation. The AZ population is relatively geographically isolated and is characterized by an atypically high prevalence of DLD25 (ie, ∼30% compared with 9% in the control rural population). This finding suggests the presence of a shared genetic component, potentially attributable to the founder effect(s).
Population and Sample
Altogether, 474 AZ individuals donated DNA. When considered in combination with first-degree relatives, 405 of these donors represented 79 nuclear and extended pedigrees (N = 1152; range, 3–474; median, 6). Of these, 359 underwent phenotyping and constituted the GWAS sample: 124 children (62 male subjects; age, 5.33–17.92 years) and 235 adults (102 male subjects; age, 18.83–83.42 years). A total of 149 were classified as affected (DLD) and 210 as typically developing individuals (Supplemental Information).
Phenotyping was performed by clinical linguists using elicited semi-structured speech samples. These samples were scored by using previously described phenotyping procedures25 to produce 5 quantitative phenotypes representing the major facets of DLD: phonetic/prosodic characteristics (eg, phonological omissions, misarticulations); well-formedness (rate of grammatical/lexical errors); complex structures (frequency of complex syntactic structures); mean length of utterance in words; and semantic/pragmatic errors (rate of errors in sentence meaning). Age-adjusted z scores were computed by using data from healthy control subjects from the comparison population to determine impairment status (ie, a z score below –1). Individuals were classified as overall DLD if they met the impairment criterion for ≥2 facets. Principal component analysis revealed that the 5 phenotypes formed 2 independent components: linguistic errors (phonetic/prosodic characteristics, well-formedness, and semantic/pragmatic errors) and syntactic complexity (complex structures and mean length of utterance in words).
Single Nucleotide Polymorphism Genotyping
The DNA extracted from peripheral blood (n = 384) or saliva/buccal swabs (n = 21) underwent quality control (QC) assessment for purity and degradation after standard collection, storage, and extraction procedures recommended by the manufacturers (Qiagen N.V. [Hilden, Germany] and DNA Genotek, Inc [Ottawa, ON, Canada]), and prepared at a concentration of 50 ng/µL.
Samples were genotyped at the Yale Center for Genome Analysis using HumanCNV 370k-Duo (n = 315) or 610k-Quad (n = 90) BeadChips (Illumina, Inc, San Diego, CA). Language status and gender distributions across the plates were not statistically different from random. Allele calling was performed by using the GenCall algorithm in GenomeStudio version 2011.1.
Samples and markers underwent QC review with GenomeStudio and SNP & Variation Suite (SVS) version 7.7.8 (GoldenHelix, Inc, Bozeman, MT). Samples with call rates >95% and verified gender were retained. A total of 223 580 autosomal single nucleotide polymorphisms (SNPs) common to 2 genotyping platforms were retained after QC so that the GenCall score was >0.30, the call rate was >95%, and minor allele frequency was >1%.
Whole Exome DNA Sequencing
Four subpedigrees were chosen for WES based on the results of complex segregation analysis26 that suggested possible Mendelian transmission. From these subpedigrees, 12 severely affected individuals were selected. Nine control non-AZ individuals without DLD from the same geographical region also underwent sequencing.
Exome capture was completed by using NimbleGen EZ Exome SeqCap v2 (Roche NimbleGen, Madison, WI). One microgram of fragmented genomic DNA was used to prepare the library using the manufacturer’s protocol (Supplemental Information). The bar-coded libraries were sequenced by using Illumina’s HiSeq 2500 platform, producing 75-bp paired-end reads that were aligned to the hg19 human genome build using NovoAlign (http://www.novocraft.com; Novocraft Technologies Sn Bhd, Selangor, Malaysia). Variant calling was performed jointly for all samples by using the HaplotypeCaller algorithm in GATK.
Genetic Association Analysis
All of the quantitative trait loci association analyses were performed within the AZ sample for a set of 5 quantitative phenotypes. SNP-based association analysis of age- and gender-adjusted quantile-normalized phenotypes was performed by using mixed linear modeling (MLM) as implemented in GEMMA27 version 0.94. MLM tests for genetic association of SNPs with quantitative traits were performed under the additive model while controlling for sample structure estimated directly from data as a genetic relatedness matrix. MLM can be considered an example of the de-correlation approach to family-based data, and we chose it as our analytical framework for several reasons. First, although a number of transmission-based approaches (eg, family-based association testing, FBAT) have been developed, their use in a large complex multigenerational pedigree is problematic and computationally intensive in the presence of missing genotypic or phenotypic data, requiring splitting the larger pedigree into smaller units; this approach, coupled with conditioning on the founders’ genotypes, can lead to a loss of power. Second and most importantly, comparative studies suggest that decorrelation approaches (and MLM among them) tend to have higher (or at least comparable) statistical power than transmission approaches even in large and complex pedigrees.28
All 5 phenotypes were first used in a multivariate MLM analysis. Two multivariate MLMs were then fitted: 1 that modeled the genetic effects on the indicators of linguistic errors and the second that used syntactic complexity. We performed gene-based association analyses as implemented in KGG3 software29 version 3.0 by using the hybrid set–based test.
Copy number variant (CNV) association analysis was performed in the FBAT30 framework. Samples underwent additional CNV-specific QC (Supplemental Information). CNVs were identified by using a univariate Copy Number Analysis Method algorithm as implemented in SVS with a minimum of 5 markers per segment. Permutation testing was used to identify cut-points, and average segment intensity was used in the analyses.
Homozygosity mapping was completed by using the runs of homozygosity (ROH) detection algorithm in SVS. The minimum size was set to 250 kb and 25 SNPs, allowing for up to 1 heterozygote and 5 missing genotypes. The maximum gap between SNPs was 100 kb, and the minimum density was 18 kb. The total length of ROHs in 5 different length brackets was log-transformed to ensure normality before analysis. ROH association and burden analyses were performed by using univariate linear and logistic regression in R (R Foundation for Statistical Computing, Vienna, Austria).
WES data were analyzed by using a set of annotation and filtering tools. This analysis assumed that the most severely affected individuals in the AZ population from familial substructures with suggestive evidence for Mendelian transmission could provide additional information about the genetic architecture of DLD in the sample by focusing on: (1) the coding variants in candidate genes highlighted in the larger GWAS sample; or (2) the disruptive coding variants in other genes that could be conferring additional DLD risk in a subsample of the AZ population. Thus, we focused on coding sequence variants that were frequent among severely affected AZ probands (present in at least 4 of the 12 affected AZ individuals) but were not present in the control sample of 9 exomes. We then excluded variants observed in >5% of the National Heart, Lung, and Blood Institute Go Exome Sequencing Project (https://esp.gs.washington.edu) and the 1000 Genomes project (http://www.1000genomes.org/) Phase 1 exomes. We then retained only those variants that were located within the genes associated with any of the phenotypes in the GWAS (P < .05 for gene-level tests), disruptive frameshift variants, and variants prioritized by eXtasy31 based on the fusion of the information about their pathogenicity, haploinsufficiency predictions, and similarity to other genes linked to related phenotypes (Supplemental Information).
P values were corrected by using either standard or adjusted Bonferroni procedures (Supplemental Information). The study protocol was approved by the Yale University (New Haven, CT) and Northern State Medical University (Arkhangelsk, Russia) internal review boards.
Genome-wide SNP Associations
No single SNP reached genome-wide statistical significance (Fig 1). Table 1 lists the top 10 nominally significant SNPs for each analysis. For linguistic errors, the strongest association (P = 5.35 × 10−7) was for rs3787751 (21q22), located in the noncoding region of the HLCS (holocarboxylase synthetase) gene, involved in the biotinylation of apocarboxylases. Holocarboxylase synthetase deficiency syndrome (MIM#253270) is characterized by neurologic, developmental, and metabolic abnormalities in infancy.32 For syntactic complexity, the top 10 SNPs included 4 SNPs (rs378968, rs3789867, rs2480933, and rs2482078) located in intronic regions of the TNC gene on chromosome 9q33 (Fig 2). TNC codes for an extracellular matrix protein implicated in cochlear development33 and autosomal dominant deafness (MIM#615629). The univariate GWAS analyses produced similar results (Supplemental Information).
We found no genome-wide significant gene-based associations for the 5-phenotype multivariate analysis or linguistic errors. Importantly, such an association was established between SETBP1 (SET binding protein 1; 18q21) and the multivariate syntactic complexity phenotype (P = 5.47 × 10−7) (Fig 2). The nuclear protein encoded by SETBP1 binds the SET nuclear oncogene protein involved in DNA replication, apoptosis, transcription, and nucleosome assembly. Rare variants in SETBP1 are associated with Schinzel-Giedion syndrome (MIM#269150) characterized by severe developmental delays.
Table 2 presents the top 10 genes for the gene-based analyses. After SETBP1, the 2 next strongest associations with syntactic complexity were found for 2 genes on chromosome 11q23: PPP2R1B (P = 4.77 × 10−5), encoding a constant regulatory subunit of protein phosphatase 2A, and SIK2 (P = 5.00 × 10−5), a gene hypothesized to play a role in neuronal protection. These findings are likely driven by the top hit SNP rs585149 (P = 1.70 × 10−5), assigned to both genes and located in the 3′-UTR region of SIK2. We also found a nominally significant association of syntactic complexity with TNC (P = .0068).
Nominally significant associations were also established between linguistic errors and several genes (ABCG4, HYOU1, and HINFP) 7 Mb away from PPP2R1B (11q23); these genes and DPAGT1 and H2AFX were associated with the combined multivariate DLD phenotype, with the top hits being rs639373 (P = 1.21 × 10−5) and rs643788 (P = 1.22 × 10−5). The functional significance of the ABCG4 product is unknown; the DPAGT1 product is crucial for glycoprotein biosynthesis; and H2AFX encodes a histone involved in the maintenance of chromatin structure. The transcription factor encoded by HINFP plays an important role in DNA methylation. We found an association between the combined multivariate phenotype and estrogen-receptor 1 (ESR1; P = 4.76 × 10−5), with rs722208 (P = 3.09 × 10−6) as a top hit. There was a nominally significant association between linguistic errors and the HLCS gene (P = 4.40 × 10−5). Neither linguistic errors nor syntactic complexity was associated with previously identified candidate DLD genes.
CNV Analysis and Homozygosity Mapping
The multivariate FBAT CNV analysis revealed several nominally statistically significant and 1 highly statistically significant CNV. However, follow-up confirmation using real-time polymerase chain reaction (PCR) failed to substantiate the presence of these CNVs. An alternative pipeline that integrated 3 CNV detection algorithms yielded no genome-wide significant associations (Supplemental Information).
Overall, AZ-affected individuals, compared with unaffected individuals, had longer cumulative lengths of ROHs that were 250 to 500 kb long (P = .006) and 1000 to 1750 kb long (P = .004), corresponding to ∼10% and ∼1% increases in estimated autosomal homozygosity, respectively (Supplemental Fig 3). The association analysis did not reveal any ROHs that were genome-wide significantly enriched in affected individuals. None of the top 20 regions overlapped with the regions identified in the SNP analyses. Several potentially relevant identified regions are discussed in Supplemental Information.
Whole Exome DNA Sequencing
We identified 14 coding sequence variants, frequent in affected AZ individuals: 4 frameshift indels, 1 inframe insertion, 2 stop gain/loss, and 7 missense variants (Table 3). SNVs were predicted by polymorphism phenotyping (PolyPhen) to be possibly or probably damaging. Although any or all of these 14 variants could be implicated in the etiology of DLD in AZ, 2 sets of findings deserve special attention.
First, multiple individuals in the AZ population carried coding sequence variants in genes that regulate neural development or are highly expressed in the brain; that is, a frameshift insertion in NT5DC2 (3p21.1) and missense SNVs in NECAB1 (8q21.3) and ILK (11p15.4). NT5DC2 has been implicated in schizophrenia34 and borderline personality disorder.35 NECAB1 is a member of the neuronal calcium-binding family of proteins essential to Ca2+-mediated signaling and is highly expressed in the temporal lobe.36 The protein encoded by ILK is 1 of the key regulators of neural stem cell astrocytic differentiation37 and neurite outgrowth.38 We also found that 7 (58%) of 12 individuals in the AZ population carried a known missense variant in CDH2 (18q12) that was found only at a 2% frequency in the 1000 Genomes data set. CDH2 codes for a major cadherin that is widely expressed prenatally in neural stem cells and supports their differentiation and migration,39 regulating the laminar organization of the cortex.40 Moreover, 7 of 12 AZ individuals carried a stop-gain variant in TCP10L2 (6q27). It is unknown whether TCP10L2 codes for a functional protein; it is highly similar to TCP10L, a primate-specific transcription factor thought to evolve via segmental duplication41 from TCP10L2 or TCP10.
Second, a missense SNV in TRIP6 (7q22.1) and a frameshift deletion in ENTHD1 (22q13) indicate commonalities between the genetic pathways identified through GWAS and WES. TRIP6 is a transcription factor that has been identified as a regulator of postnatal neural stem cell maintenance in the subventricular zone.42 ENTHD1 codes for ENTH domain-containing protein 1. ENTH domain-containing proteins are involved in synaptic vesicle endocytosis at nerve terminals at the crucial stages that precede synapse formation.43 Importantly, TRIP6 interacts with and ENTHD1 is upregulated by the same family of genes, myocyte enhancer factor–2 (MEF2), labeled MEF2A-D. MEF2 are transcription factors implicated in muscle and central nervous system differentiation. In addition to ENTHD1, MEF2 targets in human neural stem cells include SETBP1, TNC, and DKGB (3 genes highlighted by our GWAS), as well as individual genes (BDNF, DMD, and NCAM2) and gene families (cadherins, contactins, semaphorins, and serpins) implicated in (a)typical central nervous system development. A targeted formal analysis of gene list enrichment using the Enrichr tool44 suggested that, combined, GWAS and WES hits in this population are indeed enriched for MEF2 targets (for MEF2A, P = 1.28 × 10−6) (Supplemental Information), providing support to this hypothesis.
Our WES analysis also revealed the presence of 2 heterozygous missense mutations in SETBP1, carried by 2 (rs3744825) and 1 (rs1064204) sequenced AZ individual, respectively. Both were common (for European ancestry, minor allele frequency >10% in National Heart, Lung, and Blood Institute exome database) known SNPs, projected to be tolerated according to 5 different functional prediction algorithms.
We interrogated the main loci highlighted in the GWAS or WES analyses of DLD in the AZ population in an independent sample (n = 372) of children at risk for developmental disorders of language (spoken and written) by using teachers’ ratings of student’s spoken and written language skills as the main phenotype (details are given in the Supplemental Information). Association analysis controlled for age and gender and was performed by using EMMAX,45 a MLM algorithm implemented in SVS.
Both main findings were replicated. First, a significant gene-based association was found between language scores and SETBP1 (P = .009360). The top signal originated at exm1383999/rs11082414 (P = .000359), a missense SNP located within exon 4 of SETBP1 that explained 3.41% of the variance in children’s language skills. Predicted to be tolerated according to sorting intolerant from tolerant/PolyPhen, this SNP may play a role in the regulation of expression of SETBP1. The analysis of the Braineac46 brain expression quantitative trait loci database suggested that it differentiates levels of SETBP1 expression in the brain, including the cerebellar cortex, hippocampus, and temporal cortex.
Second, genes nominally associated (at P < .05) with teacher ratings of students’ spoken and written language skills were enriched for MEF2A targets (P = .0007024), replicating the finding from the discovery cohort.
We established a genome-wide association between syntactic complexity and the SETBP1 gene in the AZ sample and then replicated it in an independent sample. SETBP1 is relatively large (388 337 bp), has 2 isoforms, and is expressed widely. Although little is known about its function, it is implicated in several neurodevelopmental conditions: SETBP1 haploinsufficiency is documented in expressive DLD47–49 and intellectual disability.50 Moreover, several tentative SNP associations were found between syntactic complexity and TNC that encodes tenascin, an extracellular matrix glycoprotein involved in neural development; TNC-deficient mice exhibit structural and functional cortical abnormalities, including atypical neuronal density and abnormal dendrite morphology.51 However, the combined multivariate phenotype was also nominally associated with ESR1, a nuclear hormone receptor involved in regulation of gene expression, cell proliferation, and differentiation. Estrogen is involved in synaptogenesis, regulates neurotransmission, and modulates the activity of all types of neural cells.52 This finding is intriguing given the male bias in incidence of DLD and the recent report of associations between early postnatal gender hormone concentrations and later language development.53
Our WES highlighted 14 coding variants in a set of genes implicated in neural development and/or differentiation. Intriguingly, 2 of the WES-identified genes (ENTHD1 and TRIP6) and 3 of the GWAS-identified genes (SETBP1, TNC, and DKGB) interact with or are regulated by the MEF2 transcription factors. MEF2 isoforms are widely expressed in neural cells,54 and their activity is regulated by extracellular factors (eg, in neurons via neurotrophin stimulation or Ca2+ influx after the release of neurotransmitters). MEF2 targets show enriched expression in the central nervous system and implicate multiple signaling pathways, rendering MEF2 as a key regulator of activity-dependent synapse development.55 The complex transcriptional program of MEF2 results in the restriction of excitatory synaptic transmission via the reduction of the number of excitatory neurons, elimination of glutamatergic synapses,56 and postsynaptic differentiation of neurons (dendrite morphogenesis).57
The cascade of events regulated by the transcriptional activity of MEF2 is critical for learning and memory.58,59 A recent electrophysiological study partially attributed the DLD phenotype in the AZ population to atypicalities in the functioning of neural circuits that support attention and memory60 that were linked to syntactic complexity. It is plausible they at least partially stem from the dysregulation of common genetic pathways that orchestrate neural development.
This dysregulation can take multiple forms. Given the partial convergence of the results from the GWAS and WES, we hypothesized that the DLD phenotype in the AZ population emerged as the result of the interaction between common genetic variants that conferred background DLD susceptibility and rare variants that altered the development of language and memory circuits against that background. This extension of the threshold-dependent response model suggests that common variants in several genes (eg, SETBP1, TNC) formed the probabilistic landscape(s) of DLD vulnerability, and that coding variants in multiple different genes (eg, regulated by MEF2 such as such ENTHD1 and TRIP6 or other genes important for neural development such as CDH2 or NECAB1) conferred the critical amount of vulnerability and pushed this landscape into a critical state.
Finally, we established a higher rate of autosomal ROH burden among the affected AZ individuals compared with unaffected AZ individuals; this finding is not surprising given the isolated nature of the population and the role of ROHs in several developmental disorders.61 However, no single specific ROH was strongly associated with DLD. In addition, there was little overlap between the genetic loci identified in the GWAS analyses of the 2 multivariate phenotypes; this outcome raises an interesting hypothesis that the 2 global facets of DLD may be relatively independent at the level of their molecular neurobiology.
Our study has several limitations. First, it has a small sample size. Although it was modest for a GWAS study, however, the sample size was almost one-half of the total AZ population. Second, the unique nature of the population poses a complex issue for future research seeking to replicate these signals in other samples. Although we replicated the association finding for SETBP1 and the enrichment findings for GWAS-highlighted DLD genes for MEF2 targets in an independent sample of children at risk for a related disorder, further molecular and analytical studies in larger samples are necessary to better characterize the joint contribution of common and rare variants in the identified genes to DLD susceptibility and decipher the molecular pathways they affect.
This study presented a set of novel candidate genes and coding DNA sequence variants contributing to DLD phenotypes in the AZ population; the chief findings from this population have been replicated in an independent sample. Overall, the findings suggest that multiple genes (including a novel genome-wide significant candidate SETBP1) and genetic pathways (including the suggested MEF2-regulated pathway) are involved in DLD. This study underlines the complexity of the genetic architecture of DLDs and illustrates that even in populations with reduced genetic and environmental diversity, DLD is best conceptualized as a polygenic and etiologically complex disorder.
The authors thank the families who participated in the study for their cooperation and patience, and the local medical, kindergarten, and school officials of the AZ community for their help with data collection. They also thank Igor Pushkin, Anastasia Strelina, Liudmila Kniazeva, and various students, trainees, and employees from Northern State Medical University for their help with the logistics of the study; Dr Lesley Hart for her contributions to the early stages of the project; Drs Seongmin Han and Dean Palejev for their involvement at various stages of the project; Ms Mei Tan for her editorial assistance; and the late Dr Maria Babyonyshev for her contribution to the linguistic component of the study.
- Accepted January 26, 2016.
- Address correspondence to Elena L. Grigorenko, PhD, Child Study Center, Yale School of Medicine, 230 South Frontage Rd, New Haven, CT 06519. E-mail:
FINANCIAL DISCLOSURE: The authors have indicated they have no financial relationships relevant to this article to disclose.
FUNDING: Supported by National Institute of Health grants R01 DC007665 (Dr Grigorenko, Principal Investigator) and P50 HD052120 (Richard Wagner, Principal Investigator), NIH Centers for Mendelian Genomics (5U54HG006504), National Science Foundation Integrative Graduate Education and Research Traineeship grant 114399 (Dr Magnuson, Principal Investigator), and grant 14.Z50.31.0027 from the Government of the Russian Federation (Dr Grigorenko, Principal Investigator). Funded by the National Institutes of Health (NIH).
POTENTIAL CONFLICT OF INTEREST: The authors have indicated they have no potential conflicts of interest to disclose.
- American Psychiatric Association
- McGrath LM,
- Hutaff-Lee C,
- Scott A,
- Boada R,
- Shriberg LD,
- Pennington BF
- Im-Bolter N,
- Cohen NJ
- Palka C,
- Alfonsi M,
- Mohn A, et al.
- Villanueva P,
- Nudel R,
- Hoischen A, et al; SLI Consortium
- Villanueva P,
- Newbury DF,
- Jara L, et al.
- ↵S.A.G.E., Statistical Analysis for Genetic Epidemiology [computer program]. Version Release 6.3; 2012. Available at: http://darwin.cwru.edu. Accessed February 1, 2014
- Li D,
- Zhou J,
- Thomas DC,
- Fardo DW
- Sifrim A,
- Popovic D,
- Tranchevent LC, et al.
- Ripke S,
- O'Dushlaine C,
- Chambert K, et al.
- Prados J,
- Stenz L,
- Courtet P, et al.
- Mills J,
- Digicaylioglu M,
- Legg AT, et al.
- Zhang J,
- Woodhead GJ,
- Swaminathan SK, et al.
- Kadowaki M,
- Nakamura S,
- Machon O,
- Krauss S,
- Radice GL,
- Takeichi M
- Zhong Z,
- Qiu J,
- Chen X, et al.
- Ryan TA
- Filges I,
- Shimojima K,
- Okamoto N, et al
- Marseglia G,
- Scordo MR,
- Pescucci C, et al.
- Bouquillon S,
- Andrieux J,
- Landais E, et al.
- Irintchev A,
- Rollenhagen A,
- Troncoso E,
- Kiss JZ,
- Schachner M
- Lyons GE,
- Micales BK,
- Schwarz J,
- Martin JF,
- Olson EN
- Flavell SW,
- Cowan CW,
- Kim TK, et al.
- Shalizi A,
- Gaudillière B,
- Yuan Z, et al.
- Barbosa AC,
- Kim MS,
- Ertunc M, et al
- Cole CJ,
- Mercaldo V,
- Restivo L, et al.
- Gamsiz E D,
- Viscidi E W,
- Frederick A M, et al.
- Copyright © 2016 by the American Academy of Pediatrics