BGI Europe

Copenhagen, Denmark

BGI Europe

Copenhagen, Denmark
Time filter
Source Type

Francioli L.C.,University Utrecht | Menelaou A.,University Utrecht | Pulit S.L.,University Utrecht | Van Dijk F.,University of Groningen | And 93 more authors.
Nature Genetics | Year: 2014

Whole-genome sequencing enables complete characterization of genetic variation, but geographic clustering of rare alleles demands many diverse populations be studied. Here we describe the Genome of the Netherlands (GoNL) Project, in which we sequenced the whole genomes of 250 Dutch parent-offspring families and constructed a haplotype map of 20.4 million single-nucleotide variants and 1.2 million insertions and deletions. The intermediate coverage (∼13×) and trio design enabled extensive characterization of structural variation, including midsize events (30-500 bp) previously poorly catalogued and de novo mutations. We demonstrate that the quality of the haplotypes boosts imputation accuracy in independent samples, especially for lower frequency alleles. Population genetic analyses demonstrate fine-scale structure across the country and support multiple ancient migrations, consistent with historical changes in sea level and flooding. The GoNL Project illustrates how single-population whole-genome sequencing can provide detailed characterization of genetic variation and may guide the design of future population studies. © 2014 Nature America, Inc.

Boomsma D.I.,VU University Amsterdam | Wijmenga C.,University of Groningen | Slagboom E.P.,Leiden University | Swertz M.A.,University of Groningen | And 53 more authors.
European Journal of Human Genetics | Year: 2014

Within the Netherlands a national network of biobanks has been established (Biobanking and Biomolecular Research Infrastructure-Netherlands (BBMRI-NL)) as a national node of the European BBMRI. One of the aims of BBMRI-NL is to enrich biobanks with different types of molecular and phenotype data. Here, we describe the Genome of the Netherlands (GoNL), one of the projects within BBMRI-NL. GoNL is a whole-genome-sequencing project in a representative sample consisting of 250 trio-families from all provinces in the Netherlands, which aims to characterize DNA sequence variation in the Dutch population. The parent-offspring trios include adult individuals ranging in age from 19 to 87 years (mean=53 years; SD=16 years) from birth cohorts 1910-1994. Sequencing was done on blood-derived DNA from uncultured cells and accomplished coverage was 14-15x. The family-based design represents a unique resource to assess the frequency of regional variants, accurately reconstruct haplotypes by family-based phasing, characterize short indels and complex structural variants, and establish the rate of de novo mutational events. GoNL will also serve as a reference panel for imputation in the available genome-wide association studies in Dutch and other cohorts to refine association signals and uncover population-specific variants. GoNL will create a catalog of human genetic variation in this sample that is uniquely characterized with respect to micro-geographic location and a wide range of phenotypes. The resource will be made available to the research and medical community to guide the interpretation of sequencing projects. The present paper summarizes the global characteristics of the project. © 2014 Macmillan Publishers Limited All rights reserved.

de Been M.,University Utrecht | Lanza V.F.,University of Cantabria | de Toro M.,University of Cantabria | Scharringa J.,University Utrecht | And 12 more authors.
PLoS Genetics | Year: 2014

Third-generation cephalosporins are a class of β-lactam antibiotics that are often used for the treatment of human infections caused by Gram-negative bacteria, especially Escherichia coli. Worryingly, the incidence of human infections caused by third-generation cephalosporin-resistant E. coli is increasing worldwide. Recent studies have suggested that these E. coli strains, and their antibiotic resistance genes, can spread from food-producing animals, via the food-chain, to humans. However, these studies used traditional typing methods, which may not have provided sufficient resolution to reliably assess the relatedness of these strains. We therefore used whole-genome sequencing (WGS) to study the relatedness of cephalosporin-resistant E. coli from humans, chicken meat, poultry and pigs. One strain collection included pairs of human and poultry-associated strains that had previously been considered to be identical based on Multi-Locus Sequence Typing, plasmid typing and antibiotic resistance gene sequencing. The second collection included isolates from farmers and their pigs. WGS analysis revealed considerable heterogeneity between human and poultry-associated isolates. The most closely related pairs of strains from both sources carried 1263 Single-Nucleotide Polymorphisms (SNPs) per Mbp core genome. In contrast, epidemiologically linked strains from humans and pigs differed by only 1.8 SNPs per Mbp core genome. WGS-based plasmid reconstructions revealed three distinct plasmid lineages (IncI1- and IncK-type) that carried cephalosporin resistance genes of the Extended-Spectrum Beta-Lactamase (ESBL)- and AmpC-types. The plasmid backbones within each lineage were virtually identical and were shared by genetically unrelated human and animal isolates. Plasmid reconstructions from short-read sequencing data were validated by long-read DNA sequencing for two strains. Our findings failed to demonstrate evidence for recent clonal transmission of cephalosporin-resistant E. coli strains from poultry to humans, as has been suggested based on traditional, low-resolution typing methods. Instead, our data suggest that cephalosporin resistance genes are mainly disseminated in animals and humans via distinct plasmids. © 2014 de Been et al.

Melchionda L.,Fondazione Instituto Neurologico Carlo Besta | Fang M.,BGI Shenzhen | Wang H.,BGI Shenzhen | Fugnanesi V.,Fondazione Instituto Neurologico Carlo Besta | And 13 more authors.
Orphanet Journal of Rare Diseases | Year: 2013

Background: We studied a family including two half-siblings, sharing the same mother, affected by slowly progressive, adult-onset neurological syndromes. In spite of the diversity of the clinical features, characterized by a mild movement disorder with cognitive impairment in the elder patient, and severe motor-neuron disease (MND) in her half-brother, the brain Magnetic Resonance Imaging (MRI) features were compatible with adult-onset Alexander's disease (AOAD), suggesting different expression of the same, genetically determined, condition. Methods. Since mutations in the alpha isoform of glial fibrillary acidic protein, GFAP-α, the only cause so far known of AOAD, were excluded, we applied exome Next Generation Sequencing (NGS) to identify gene variants, which were then functionally validated by molecular characterization of recombinant and patient-derived cells. Results: Exome-NGS revealed a mutation in a previously neglected GFAP isoform, GFAP-, which disrupts the GFAP-associated filamentous cytoskeletal meshwork of astrocytoma cells. To shed light on the different clinical features in the two patients, we sought for variants in other genes. The male patient had a mutation, absent in his half-sister, in X-linked histone deacetylase 6, a candidate MND susceptibility gene. Conclusions: Exome-NGS is an unbiased approach that not only helps identify new disease genes, but may also contribute to elucidate phenotypic expression. © 2013 Melchionda et al.; licensee BioMed Central Ltd.

Pujolar J.M.,University of Aarhus | Jacobsen M.W.,University of Aarhus | Als T.D.,Technical University of Denmark | Als T.D.,University of Aarhus | And 10 more authors.
Heredity | Year: 2014

The two North Atlantic eel species, the European eel (Anguilla anguilla) and the American eel (Anguilla rostrata), spawn in partial sympatry in the Sargasso Sea, providing ample opportunity to interbreed. In this study, we used a RAD (Restriction site Associated DNA) sequencing approach to identify species-specific diagnostic single-nucleotide polymorphisms (SNPs) and design a low-density array that combined with screening of a diagnostic mitochondrial DNA marker. Eels from Iceland (N=159) and from the neighboring Faroe Islands (N=29) were genotyped, along with 94 larvae (49 European and 45 American eel) collected in the Sargasso Sea. Our SNP survey showed that the majority of Icelandic eels are pure European eels but there is also an important contribution of individuals of admixed ancestry (10.7%). Although most of the hybrids were identified as F1 hybrids from European eel female × American eel male crosses, backcrosses were also detected, including a first-generation backcross (F1 hybrid × pure European eel) and three individuals identified as second-generation backcrosses originating from American eel × F1 hybrid backcrosses interbreeding with pure European eels. In comparison, no hybrids were observed in the Faroe Islands, the closest bodies of land to Iceland. It is possible that hybrids show an intermediate migratory behaviour between the two parental species that ultimately brings hybrid larvae to the shores of Iceland, situated roughly halfway between the Sargasso Sea and Europe. Only two hybrids were observed among Sargasso Sea larvae, both backcrosses, but no F1 hybrids, that points to temporal variation in the occurrence of hybridization. © 2014 Macmillan Publishers Limited All rights reserved.

Pujolar J.M.,University of Aarhus | Jacobsen M.W.,University of Aarhus | Frydenberg J.,University of Aarhus | Als T.D.,Technical University of Denmark | And 7 more authors.
Molecular Ecology Resources | Year: 2013

Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the European eel using the RAD sequencing approach that was simultaneously identified and scored in a genome-wide scan of 30 individuals. Whereas genomic resources are increasingly becoming available for this species, including the recent release of a draft genome, no genome-wide set of SNP markers was available until now. The generated SNPs were widely distributed across the eel genome, aligning to 4779 different contigs and 19 703 different scaffolds. Significant variation was identified, with an average nucleotide diversity of 0.00529 across individuals. Results varied widely across the genome, ranging from 0.00048 to 0.00737 per locus. Based on the average nucleotide diversity across all loci, long-term effective population size was estimated to range between 132 000 and 1 320 000, which is much higher than previous estimates based on microsatellite loci. The generated SNP resource consisting of 82 425 loci and 376 918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome. © 2013 John Wiley & Sons Ltd.

Roos L.,Copenhagen University | Fang M.,BGI Shenzhen | Dali C.,Copenhagen University | Jensen H.,Copenhagen University | And 8 more authors.
Clinical Genetics | Year: 2014

Anomalies of eye development can lead to the rare eye malformations microphthalmia and anophthalmia (small or absent ocular globes), which are genetically very heterogeneous. Several genes have been associated with microphthalmia and anophthalmia, and exome sequencing has contributed to the identification of new genes. Very recently, homozygous variations within ALDH1A3 have been associated with autosomal recessive microphthalmia with or without cysts or coloboma, and with variable subphenotypes of developmental delay/autism spectrum disorder in eight families. In a consanguineous family where three of the five siblings were affected with microphthalmia/coloboma, we identified a novel homozygous missense mutation in ALDH1A3 using exome sequencing. Of the three affected siblings, one had intellectual disability and one had intellectual disability and autism, while the last one presented with normal development. This study contributes further to the description of the clinical spectrum associated with ALDH1A3 mutations, and illustrates the interfamilial clinical variation observed in individuals with ALDH1A3 mutations. © 2013 John Wiley & Sons A/S.

De Keersmaecker K.,Catholic University of Leuven | De Keersmaecker K.,Center for the Biology of Disease | Atak Z.K.,Catholic University of Leuven | Li N.,BGI Europe | And 41 more authors.
Nature Genetics | Year: 2013

T-cell acute lymphoblastic leukemia (T-ALL) is caused by the cooperation of multiple oncogenic lesions. We used exome sequencing on 67 T-ALLs to gain insight into the mutational spectrum in these leukemias. We detected protein-altering mutations in 508 genes, with an average of 8.2 mutations in pediatric and 21.0 mutations in adult T-ALL. Using stringent filtering, we predict seven new oncogenic driver genes in T-ALL. We identify CNOT3 as a tumor suppressor mutated in 7 of 89 (7.9%) adult T-ALLs, and its knockdown causes tumors in a sensitized Drosophila melanogaster model. In addition, we identify mutations affecting the ribosomal proteins RPL5 and RPL10 in 12 of 122 (9.8%) pediatric T-ALLs, with recurrent alterations of Arg98 in RPL10. Yeast and lymphoid cells expressing the RPL10 Arg98Ser mutant showed a ribosome biogenesis defect. Our data provide insights into the mutational landscape of pediatric versus adult T-ALL and identify the ribosome as a potential oncogenic factor. © 2013 Nature America, Inc. All rights reserved.

Smith C.G.,University of Cardiff | Naven M.,University of Cardiff | Harris R.,University of Cardiff | Colley J.,University of Cardiff | And 10 more authors.
Human Mutation | Year: 2013

Inherited factors account for around one third of all colorectal cancers (CRCs) and include rare high penetrance mutations in APC, MSH2, MSH6, and POLE. Here, we sought novel tumor-suppressor genes that predispose to CRC by exome resequencing 50 sporadic patients with advanced CRC (18 diagnosed ≤35 years of age) at a mean coverage of 30×. To help identify potentially pathogenic alleles, we initially sought rare or novel germline truncating mutations in 1,138 genes that were likely to play a role in colorectal tumorigenesis. In total, 32 such mutations were identified and confirmed, and included an insertion in APC and a deletion in POLE, thereby validating our approach for identifying disease alleles. We sought somatic mutations in the corresponding genes in the CRCs of the patients harboring the germline lesions and found biallelic inactivation of FANCM, LAMB4, PTCHD3, LAMC3, and TREX2, potentially implicating these genes as tumor suppressors. We also identified a patient who carried a germline truncating mutation in NOTCH3, part of the Notch signaling cascade that maintains intestinal homeostasis. Our whole exome analyses provided further gene lists to facilitate the identification of potential predisposition alleles. We exomeresequenced 50 sporadic patients with advanced CRC. To identify predisposition alleles, we sought rare/novel germline truncating mutations in 1138 genes considered likely to play a role in colorectal tumorigenesis. Thirty-two such mutations were identified, including an insertion in APC, thereby validating our approach. We sought somatic mutations in the corresponding genes in the CRCs of the patients harbouring the germline lesions and found biallelic inactivation of FANCM, LAMB4, PTCHD3, LAMC3 and TREX2, implicating their potential role in CRC-predisposition. © 2013 WILEY PERIODICALS, INC.

Loading BGI Europe collaborators
Loading BGI Europe collaborators