Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome

Publication Overview
TitleDevelopment of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome
AuthorsVerma P, Shah N, Bhatia S
TypeJournal Article
Journal NamePlant biotechnology journal
Volume11
Issue7
Year2013
Page(s)894-905
CitationVerma P, Shah N, Bhatia S. Development of an expressed gene catalogue and molecular markers from the de novo assembly of short sequence reads of the lentil (Lens culinaris Medik.) transcriptome. Plant biotechnology journal. 2013 Sep; 11(7):894-905.

Abstract

Genomic resources such as ESTs, molecular markers and linkage maps are essential for crop improvement. However, these resources are still limited in important legumes such as lentil (Lens culinaris Medik.), which is valued world wide as a rich source of dietary protein. In this study, the de novo transcriptome assembly of 119,855,798 short reads, generated by Illumina paired-end sequencing, was performed using various assembly programs. This resulted in 42,196 nonredundant high-quality transcripts of average length 810 bases, N50 value of 1,432 and an average expression per transcript of 26.21 rpkm reads per kilobase per million(RPKM). Similarity search with the unigenes and protein sequences of other plants resulted in maximum similarity with soybean. A total of 20,009 nonredundant transcripts showed similarity with the UniProtKB database and of these, 18,064 transcripts were grouped into three main GO categories, that is, biological process (15,126), molecular function (15,505) and cellular component (9,434). Annotated transcripts were mapped to 289 predicted Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and 8,893 transcripts were classified into 24 functional categories based on Cluster of Orthologous Groups (COG) of proteins. Mining the data set for the presence of SSRs resulted in 8,722 SSRs with a frequency occurrence of one SSR per 3.92 kb. From these, 5,673 SSR primer pairs were designed, and a subset of these were utilized for diversity analysis. This study, which provides a large data set of annotated transcripts and gene-based SSR markers, would serve as a foundation for various applications in lentil breeding and genetics.

Features
This publication contains information about 54 features:
Feature NameUniquenameType
LcESSR49LcESSR49genetic_marker
LcESSR5LcESSR5genetic_marker
LcESSR50LcESSR50genetic_marker
LcESSR51LcESSR51genetic_marker
LcESSR52LcESSR52genetic_marker
LcESSR53LcESSR53genetic_marker
LcESSR54LcESSR54genetic_marker
LcESSR57LcESSR57genetic_marker
LcESSR58LcESSR58genetic_marker
LcESSR60LcESSR60genetic_marker
LcESSR63LcESSR63genetic_marker
LcESSR64LcESSR64genetic_marker
LcESSR65LcESSR65genetic_marker
LcESSR66LcESSR66genetic_marker
LcESSR67LcESSR67genetic_marker
LcESSR7LcESSR7genetic_marker
LcESSR70LcESSR70genetic_marker
LcESSR74LcESSR74genetic_marker
LcESSR78LcESSR78genetic_marker
LcESSR79LcESSR79genetic_marker
LcESSR8LcESSR8genetic_marker
LcESSR80LcESSR80genetic_marker
LcESSR81LcESSR81genetic_marker
LcESSR83LcESSR83genetic_marker
LcESSR84LcESSR84genetic_marker

Pages

Properties
Additional details for this publication include:
Property NameValue
Journal CountryEngland
DOI10.1111/pbi.12082
Elocation10.1111/pbi.12082
Publication ModelPrint-Electronic
ISSN1467-7652
eISSN1467-7652
Publication Date2013 Sep
Journal AbbreviationPlant Biotechnol. J.
LanguageEnglish
Language Abbreng
Publication TypeJournal Article
Copyright© 2013 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Publication TypeResearch Support, Non-U.S. Gov't