PubMed | China National GeneBank Shenzhen, Zoologisches Forschungsmuseum Alexander Koenig ZFMK Zentrum fur Molekulare Biodiversitatsforschung ZMB, University of California at Davis and BGI Shenzhen
Type: Journal Article | Journal: Molecular ecology resources | Year: 2016
Biodiversity analyses based on next-generation sequencing (NGS) platforms have developed by leaps and bounds in recent years. A PCR-free strategy, which can alleviate taxonomic bias, was considered as a promising approach to delivering reliable species compositions of targeted environments. The major impediment of such a method is the lack of appropriate mitochondrial DNA enrichment ways. Because mitochondrial genomes (mitogenomes) make up only a small proportion of total DNA, PCR-free methods will inevitably result in a huge excess of data (>99%). Furthermore, the massive volume of sequence data is highly demanding on computing resources. Here, we present a mitogenome enrichment pipeline via a gene capture chip that was designed by virtue of the mitogenome sequences of the 1000 Insect Transcriptome Evolution project (1KITE, www.1kite.org). A mock sample containing 49 species was used to evaluate the efficiency of the mitogenome capture method. We demonstrate that the proportion of mitochondrial DNA can be increased by approximately 100-fold (from the original 0.47% to 42.52%). Variation in phylogenetic distances of target taxa to the probe set could in principle result in bias in abundance. However, the frequencies of input taxa were largely maintained after capture (R(2) = 0.81). We suggest that our mitogenome capture approach coupled with PCR-free shotgun sequencing could provide ecological researchers an efficient NGS method to deliver reliable biodiversity assessment.
PubMed | University of Tübingen, University of Bonn, University of Kassel, University of Cologne and 4 more.
Type: Journal Article | Journal: BMC genomics | Year: 2016
Body plan development in multi-cellular organisms is largely determined by homeotic genes. Expression of homeotic genes, in turn, is partially regulated by insulator binding proteins (IBPs). While only a few enhancer blocking IBPs have been identified in vertebrates, the common fruit fly Drosophila melanogaster harbors at least twelve different enhancer blocking IBPs. We screened recently compiled insect transcriptomes from the 1KITE project and genomic and transcriptomic data from public databases, aiming to trace the origin of IBPs in insects and other arthropods.Our study shows that the last common ancestor of insects (Hexapoda) already possessed a substantial number of IBPs. Specifically, of the known twelve insect IBPs, at least three (i.e., CP190, Su(Hw), and CTCF) already existed prior to the evolution of insects. Furthermore we found GAF orthologs in early branching insect orders, including Zygentoma (silverfish and firebrats) and Diplura (two-pronged bristletails). Mod(mdg4) is most likely a derived feature of Neoptera, while Pita is likely an evolutionary novelty of holometabolous insects. Zw5 appears to be restricted to schizophoran flies, whereas BEAF-32, ZIPIC and the Elba complex, are probably unique to the genus Drosophila. Selection models indicate that insect IBPs evolved under neutral or purifying selection.Our results suggest that a substantial number of IBPs either pre-date the evolution of insects or evolved early during insect evolution. This suggests an evolutionary history of insulator binding proteins in insects different to that previously thought. Moreover, our study demonstrates the versatility of the 1KITE transcriptomic data for comparative analyses in insects and other arthropods.
PubMed | Centers for Disease Control and Prevention, South China Agricultural University, Shenzhen Third Peoples Hospital, University of Chinese Academy of Sciences and 13 more.
Type: Journal Article | Journal: Cell host & microbe | Year: 2016
Constant surveillance of live poultry markets (LPMs) is currently the best way to predict and identify emerging avian influenza viruses (AIVs) that pose a potential threat to public health. Through surveillance of LPMs from 16 provinces and municipalities in China during 2014-2016, we identified 3,174 AIV-positive samples and isolated and sequenced 1,135 AIVs covering 31 subtypes. Our analysis shows that H5N6 has replaced H5N1 as one of the dominant AIV subtypes in southern China, especially in ducks. Phylogenetic analysis reveals that H5N6 arose from reassortments of H5 and H6N6 viruses, with the hemagglutinin and neuraminidase combinations being strongly lineage specific. H5N6 viruses constitute atleast 34 distinct genotypes derived from various evolutionary pathways. Notably, genotype G1.2 virus, with internal genes from the chicken H9N2/H7N9 gene pool, was responsible for at least five human H5N6 infections. Our findings highlight H5N6 AIVs as potential threats to public health and agriculture.
Chen L.,University of Chinese Academy of Sciences |
Tang L.,Wuhan University |
Xiang H.,University of Chinese Academy of Sciences |
Jin L.,China National Genebank Shenzhen |
And 5 more authors.
GigaScience | Year: 2014
Genetic modification has long provided an approach for " reverse genetics" , analyzing gene function and linking DNA sequence to phenotype. However, traditional genome editing technologies have not kept pace with the soaring progress of the genome sequencing era, as a result of their inefficiency, time-consuming and labor-intensive methods. Recently, invented genome modification technologies, such as ZFN (Zinc Finger Nuclease), TALEN (Transcription Activator-Like Effector Nuclease), and CRISPR/Cas9 nuclease (Clustered Regularly Interspaced Short Palindromic Repeats/Cas9 nuclease) can initiate genome editing easily, precisely and with no limitations by organism. These new tools have also offered intriguing possibilities for conducting functional large-scale experiments. In this review, we begin with a brief introduction of ZFN, TALEN, and CRISPR/Cas9 technologies, then generate an extensive prediction of effective TALEN and CRISPR/Cas9 target sites in the genomes of a broad range of taxonomic species. Based on the evidence, we highlight the potential and practicalities of TALEN and CRISPR/Cas9 editing in non-model organisms, and also compare the technologies and test interesting issues such as the functions of candidate domesticated, as well as candidate genes in life-environment interactions. When accompanied with a high-throughput sequencing platform, we forecast their potential revolutionary impacts on evolutionary and ecological research, which may offer an exciting prospect for connecting the gap between DNA sequence and phenotype in the near future. © 2014 Chen et al.; licensee BioMed Central Ltd.
PubMed | University of Chinese Academy of Sciences, Wuhan University, Kunming University of Science and Technology and China National Genebank Shenzhen
Type: | Journal: GigaScience | Year: 2014
Genetic modification has long provided an approach for reverse genetics, analyzing gene function and linking DNA sequence to phenotype. However, traditional genome editing technologies have not kept pace with the soaring progress of the genome sequencing era, as a result of their inefficiency, time-consuming and labor-intensive methods. Recently, invented genome modification technologies, such as ZFN (Zinc Finger Nuclease), TALEN (Transcription Activator-Like Effector Nuclease), and CRISPR/Cas9 nuclease (Clustered Regularly Interspaced Short Palindromic Repeats/Cas9 nuclease) can initiate genome editing easily, precisely and with no limitations by organism. These new tools have also offered intriguing possibilities for conducting functional large-scale experiments. In this review, we begin with a brief introduction of ZFN, TALEN, and CRISPR/Cas9 technologies, then generate an extensive prediction of effective TALEN and CRISPR/Cas9 target sites in the genomes of a broad range of taxonomic species. Based on the evidence, we highlight the potential and practicalities of TALEN and CRISPR/Cas9 editing in non-model organisms, and also compare the technologies and test interesting issues such as the functions of candidate domesticated, as well as candidate genes in life-environment interactions. When accompanied with a high-throughput sequencing platform, we forecast their potential revolutionary impacts on evolutionary and ecological research, which may offer an exciting prospect for connecting the gap between DNA sequence and phenotype in the near future.
PubMed | CAS Kunming Institute of Zoology, Puer Institute of Pu er Tea, China National GeneBank Shenzhen, Agri Biotech Laboratory and 4 more.
Type: Journal Article | Journal: Molecular plant | Year: 2015
Dendrobium officinale Kimura et Migo is a traditional Chinese orchid herb that has both ornamental value and a broad range of therapeutic effects. Here, we report the first de novo assembled 1.35 Gb genome sequences for D.officinale by combining the second-generation Illumina Hiseq 2000 and third-generation PacBio sequencing technologies. We found that orchids have a complete inflorescence gene set and have some specific inflorescence genes. We observed gene expansion in gene families related to fungus symbiosis and drought resistance. We analyzed biosynthesis pathways of medicinal components of D.officinale and found extensive duplication of SPS and SuSy genes, which are related to polysaccharide generation, and that the pathway of D.officinale alkaloid synthesis could be extended to generate 16-epivellosimine. The D.officinale genome assembly demonstrates a new approach to deciphering large complex genomes and, as an important orchid species and a traditional Chinese medicine, the D.officinale genome will facilitate future research on the evolution of orchid plants, as well as the study of medicinal components and potential genetic breeding of the dendrobe.
PubMed | China National Genebank Shenzhen, University of New South Wales, U.S. Center for Disease Control and Prevention, Chinese National Institute for Viral Disease Control and Prevention and CAS Institute of Microbiology
Type: Journal Article | Journal: PLoS pathogens | Year: 2016
The emergence of severe acute respiratory syndrome coronavirus (SARS-CoV) in 2002 and Middle East respiratory syndrome coronavirus (MERS-CoV) in 2012 has generated enormous interest in the biodiversity, genomics and cross-species transmission potential of coronaviruses, especially those from bats, the second most speciose order of mammals. Herein, we identified a novel coronavirus, provisionally designated Rousettus bat coronavirus GCCDC1 (Ro-BatCoV GCCDC1), in the rectal swab samples of Rousettus leschenaulti bats by using pan-coronavirus RT-PCR and next-generation sequencing. Although the virus is similar to Rousettus bat coronavirus HKU9 (Ro-BatCoV HKU9) in genome characteristics, it is sufficiently distinct to be classified as a new species according to the criteria defined by the International Committee of Taxonomy of Viruses (ICTV). More striking was that Ro-BatCoV GCCDC1 contained a unique gene integrated into the 3-end of the genome that has no homologs in any known coronavirus, but which sequence and phylogeny analyses indicated most likely originated from the p10 gene of a bat orthoreovirus. Subgenomic mRNA and cellular-level observations demonstrated that the p10 gene is functional and induces the formation of cell syncytia. Therefore, here we report a putative heterologous inter-family recombination event between a single-stranded, positive-sense RNA virus and a double-stranded segmented RNA virus, providing insights into the fundamental mechanisms of viral evolution.
Parks M.,Griffith University |
Subramanian S.,Griffith University |
Baroni C.,University of Pisa |
Salvatore M.C.,University of Pisa |
And 4 more authors.
Philosophical Transactions of the Royal Society B: Biological Sciences | Year: 2015
Recently, the study of ancient DNA (aDNA) has been greatly enhanced by the development of second-generation DNA sequencing technologies and targeted enrichment strategies. These developments have allowed the recovery of several complete ancient genomes, a result that would have been considered virtually impossible only a decade ago. Prior to these developments, aDNA research was largely focused on the recovery of short DNA sequences and their use in the study of phylogenetic relationships, molecular rates, species identification and population structure. However, it is now possible to sequence a large number of modern and ancient complete genomes from a single species and thereby study the genomic patterns of evolutionary change over time. Such a study would herald the beginnings of ancient population genomics and its use in the study of evolution. Species that are amenable to such large-scale studies warrant increased research effort. We report here progress on a population genomic study of the Adélie penguin (Pygoscelis adeliae). This species is ideally suited to ancient population genomic research because both modern and ancient samples are abundant in the permafrost conditions of Antarctica. This species will enable us to directly address many of the fundamental questions in ecology and evolution. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
PubMed | University of Toronto, China National Genebank Shenzhen, CAS Kunming Institute of Zoology, Xishuangbanna Animal Husbandry and Veterinary Station of Yunnan Province and 3 more.
Type: Journal Article | Journal: Molecular biology and evolution | Year: 2015
Much like other indigenous domesticated animals, Tibetan chickens living at high altitudes (2,200-4,100 m) show specific physiological adaptations to the extreme environmental conditions of the Tibetan Plateau, but the genetic bases of these adaptations are not well characterized. Here, we assembled a de novo genome of a Tibetan chicken and resequenced whole genomes of 32 additional chickens, including Tibetan chickens, village chickens, game fowl, and Red Junglefowl, and found that the Tibetan chickens could broadly be placed into two groups. Further analyses revealed that several candidate genes in the calcium-signaling pathway are possibly involved in adaptation to the hypoxia experienced by these chickens, as these genes appear to have experienced directional selection in the two Tibetan chicken populations, suggesting a potential genetic mechanism underlying high altitude adaptation in Tibetan chickens. The candidate selected genes identified in this study, and their variants, may be useful targets for clarifying our understanding of the domestication of chickens in Tibet, and might be useful in current breeding efforts to develop improved breeds for the highlands.
PubMed | China National GeneBank Shenzhen and Huahan Gene Research Institute HHGene
Type: | Journal: Advances in experimental medicine and biology | Year: 2016
Stem cells are highly promising resources for application in cell therapy, regenerative medicine, drug discovery, toxicology and developmental biology research. Stem cell banks have been increasingly established all over the world in order to preserve their cellular characteristics, prevent contamination and deterioration, and facilitate their effective use in basic and translational research, as well as current and future clinical application. Standardization and quality control during banking procedures are essential to allow researchers from different labs to compare their results and to develop safe and effective new therapies. Furthermore, many stem cells come from once-in-a-life time tissues. Cord blood for example, thrown away in the past, can be used to treat many diseases such as blood cancers nowadays. Meanwhile, these cells stored and often banked for long periods can be immediately available for treatment when needed and early treatment can minimize disease progression. This paper provides an overview of the fundamental principles of stem cell banking, including: (i) a general introduction of the construction and architecture commonly used for stem cell banks; (ii) a detailed section on current quality management practices; (iii) a summary of questions we should consider for long-term storage, such as how long stem cells can be stored stably, how to prevent contamination during long term storage, etc.; (iv) the prospects for stem cell banking.