gil, South Korea
gil, South Korea

Time filter

Source Type

PubMed | National Science Foundation, Korea University, The Taejin Genome Institute, Qingdao Agricultural University and Seoul National University
Type: | Journal: Genomics data | Year: 2016

Sorghum (Sorghum bicolor), also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748GB of raw data. The raw data in this study can be available in NCBI SRA database with accession number of SRX1715644. Using the Trinity program, we identified 222,161 transcripts from sorghum variety Taejin. We further predicted coding regions within the assembled transcripts by the TransDecoder program, resulting in a total of 148,531 proteins. We carried out BLASTP against the Swiss-Prot protein sequence database to annotate the functions of the identified proteins. To our knowledge, this is the first transcriptome data for a sorghum variety derived from Korea, and it can be usefully applied to the generation of genetic markers.


PubMed | National Science Foundation, The Taejin Genome Institute, Qingdao Agricultural University and Seoul National University
Type: | Journal: Genomics data | Year: 2016

The adzuki bean (Vigna angularis), a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas. The adzuki bean is known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries. Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red were paired-end sequenced using the Illumina HiSeq 2000 system. The raw data in this study can be available in NCBI SRA database with accession numbers of SRR3406660 and SRR3406553. After de novo transcriptome assembly using Trinity, we obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. We predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively, by the TransDecoder program. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of identified proteins. Taken together, we provide transcriptomes of two adzuki bean varieties by RNA-Seq, which might be usefully applied to generate molecular markers.


PubMed | National Science Foundation, Korea University, The Taejin Genome Institute, Qingdao Agricultural University and Seoul National University
Type: | Journal: Genomics data | Year: 2016

Foxtail millet (Setaria italica) belonging to the family Poaceae is an important millet that is widely cultivated in East Asia. Of the cultivated millets, the foxtail millet has the longest history and is one of the main food crops in South India and China. Moreover, foxtail millet is a model plant system for biofuel generation utilizing the C4 photosynthetic pathway. In this study, we carried out de novo transcriptome assembly for the foxtail millet variety Taejin collected from Korea using next-generation sequencing. We obtained a total of 8.676GB raw data by paired-end sequencing. The raw data in this study can be available in NCBI SRA database with accession number of SRR3406552. The Trinity program was used to de novo assemble 145,332 transcripts. Using the TransDecoder program, we predicted 82,925 putative proteins. BLASTP was performed against the Swiss-Prot protein sequence database to annotate the functions of identified proteins, resulting in 20,555 potentially novel proteins. Taken together, this study provides transcriptome data for the foxtail millet variety Taejin by RNA-Seq.


Jo Y.,Seoul National University | Chu H.,Korea Basic Science Institute | Kim H.,Seoul National University | Cho J.K.,The Taejin Genome Institute | And 7 more authors.
European Journal of Plant Pathology | Year: 2016

Hop stunt viroid (HSVd) infects a wide range of plants. In this study, we assembled de novo two HSVd genomes from the transcriptome of an apricot species. To examine HSVd variation in a single tree, we sequenced 70 HSVd genomes from unique apricot and plum trees and identified 11 HSVd variants. In addition, we analysed all known 572 HSVd sequences and identified 382 non-redundant HSVd variants. Phylogenetic analysis identified five groups of the 382 HSVd variants. Furthermore, we generated a consensus HSVd sequence by averaging across all 382 sequences. Comparative sequence analysis identified several regions showing sequence variation, while the terminal left region of the rod-like structure was highly conserved. © 2016 Koninklijke Nederlandse Planteziektenkundige Vereniging


Jo Y.,Seoul National University | Choi H.,Seoul National University | Kim S.-M.,South Korean National Institute of Crop Science | Kim S.-L.,South Korean National Institute of Crop Science | And 3 more authors.
BMC Genomics | Year: 2016

Background: Next-generation sequencing (NGS) provides many possibilities for plant virology research. In this study, we performed integrated analyses using plant transcriptome data for plant virus identification using Apple stem grooving virus (ASGV) as an exemplar virus. We used 15 publicly available transcriptome libraries from three different studies, two mRNA-Seq studies and a small RNA-Seq study. Results: We de novo assembled nearly complete genomes of ASGV isolates Fuji and Cuiguan from apple and pear transcriptomes, respectively, and identified single nucleotide variations (SNVs) of ASGV within the transcriptomes. We demonstrated the application of NGS raw data to confirm viral infections in the plant transcriptomes. In addition, we compared the usability of two de novo assemblers, Trinity and Velvet, for virus identification and genome assembly. A phylogenetic tree revealed that ASGV and Citrus tatter leaf virus (CTLV) are the same virus, which was divided into two clades. Recombination analyses identified six recombination events from 21 viral genomes. Conclusions: Taken together, our in silico analyses using NGS data provide a successful application of plant transcriptomes to reveal extensive information associated with viral genome assembly, SNVs, phylogenetic relationships, and genetic recombination. © 2016 The Author(s).


Jo Y.,Seoul National University | Lian S.,Qingdao Agricultural University | Cho J.K.,The Taejin Genome Institute | Choi H.,Seoul National University | And 5 more authors.
Genomics Data | Year: 2016

The adzuki bean (Vigna angularis), a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas. The adzuki bean is known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries. Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red were paired-end sequenced using the Illumina HiSeq 2000 system. The raw data in this study can be available in NCBI SRA database with accession numbers of SRR3406660 and SRR3406553. After de novo transcriptome assembly using Trinity, we obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. We predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively, by the TransDecoder program. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of identified proteins. Taken together, we provide transcriptomes of two adzuki bean varieties by RNA-Seq, which might be usefully applied to generate molecular markers. © 2016.


Jo Y.,Seoul National University | Lian S.,Qingdao Agricultural University | Cho J.K.,The Taejin Genome Institute | Choi H.,Seoul National University | And 5 more authors.
Genomics Data | Year: 2016

Sorghum (Sorghum bicolor), also known as great millet, is one of the most popular cultivated grass species in the world. Sorghum is frequently consumed as food for humans and animals as well as used for ethanol production. In this study, we conducted de novo transcriptome assembly for sorghum variety Taejin by next-generation sequencing, obtaining 8.748 GB of raw data. The raw data in this study can be available in NCBI SRA database with accession number of SRX1715644. Using the Trinity program, we identified 222,161 transcripts from sorghum variety Taejin. We further predicted coding regions within the assembled transcripts by the TransDecoder program, resulting in a total of 148,531 proteins. We carried out BLASTP against the Swiss-Prot protein sequence database to annotate the functions of the identified proteins. To our knowledge, this is the first transcriptome data for a sorghum variety derived from Korea, and it can be usefully applied to the generation of genetic markers. © 2016.


Jo Y.,Seoul National University | Lian S.,Qingdao Agricultural University | Cho J.K.,The Taejin Genome Institute | Choi H.,Seoul National University | And 5 more authors.
Genomics Data | Year: 2016

Foxtail millet (Setaria italica) belonging to the family Poaceae is an important millet that is widely cultivated in East Asia. Of the cultivated millets, the foxtail millet has the longest history and is one of the main food crops in South India and China. Moreover, foxtail millet is a model plant system for biofuel generation utilizing the C4 photosynthetic pathway. In this study, we carried out de novo transcriptome assembly for the foxtail millet variety Taejin collected from Korea using next-generation sequencing. We obtained a total of 8.676 GB raw data by paired-end sequencing. The raw data in this study can be available in NCBI SRA database with accession number of SRR3406552. The Trinity program was used to de novo assemble 145,332 transcripts. Using the TransDecoder program, we predicted 82,925 putative proteins. BLASTP was performed against the Swiss-Prot protein sequence database to annotate the functions of identified proteins, resulting in 20,555 potentially novel proteins. Taken together, this study provides transcriptome data for the foxtail millet variety Taejin by RNA-Seq. © 2016.

Loading The Taejin Genome Institute collaborators
Loading The Taejin Genome Institute collaborators