C. arietinum CSFL RefTrans V2

Overview
Analysis NameC. arietinum CSFL RefTrans V2
Methodreftrans (2.2.31+)
Sourcec.arietinum_csfl_reftransV2
Date performed2017-09-25

Materials & Methods

CSFL Cicer arietinum RefTrans V2 combines peer-reviewed published RNA-Seq and EST data sets to create a reference transcriptome (RefTrans, 56,176 sequences) for Cicer arietinum and provides putative gene function identified by homology to known proteins.

In  Cicer arietinum RefTrans V2, 1.0 billion RNA-Seq reads from publicly available, peer-reviewed Cicer arietinum RNA-Seq data sets (Jain et al, 2015 [SRP062857], Garg et al, 2016 [SRP059919, SRP059951], Gupta et al, 2017 [SRP041784], Pandey et al, 2017 [SRP078184]), and 52,788 ESTs, were downloaded from the NCBI Short Read Archive database, the EBI database and the NCBI dbEST database, respectively. The RNA-Seq reads and ESTs were assembled by using the Mainlab RefTrans pipeline (manuscript in preparation – details of pipeline provided ahead of publication on request). The RefTran sequences were functionally characterized by pairwise comparison using the BLASTX algorithm against the Swiss-Prot (UniProtKB/Swiss-Prot Release 2017_06) and TrEMBL (UniProtKB/TrEMBL Release 2017_06) protein databases.  Information on the top 10 matches with an expectation (E) value of ≤ 1E-06 were recorded and stored in CSFL together with the RefTrans sequences. InterPro domains and Gene Ontology assignments were made to Cicer arietinum RefTrans V2 using InterProScan at the EBI through Blast2GO.  The transcriptome and associated annotation are available to download, search by name, keyword (functional description), or mapped location, and view on the genome through JBrowse.

Properties
Additional information about this analysis:
Property NameValue
Analysis Typereftrans
Download

 

Assembly

 
RefTrans in FASTA format  (56,176 sequences) Cicer arietinum  RefTrans v2 FASTA format 

Homology Analysis 

Homology was determined using the BLASTx algorithm with an e-value cutoff of 1.0 e-6 for the Cicer arietinum RefTrans V2 vs. the Swiss-Prot (UniProtKB/Swiss-Prot Release 2017_06), TrEMBL(UniProtKB/TrEMBL Release 2017_06).  Only the best match was kept. 

UniProtKB/Swiss-Prot

BLAST of refTrans to Swiss-Prot EXCEL format (73% refTrans with homologies) Cicer arietinum  RefTrans V2 vs Swissprot
RefTrans with homologies  FASTA format Cicer arietinum RefTrans V2 vs Swissprot_hit
RefTrans without homologies FASTA format Cicer arietinum  RefTrans V2 vs Swissprot_noHit

UniProtKB/TrEMBL

BLAST of refTrans to TrEMBL EXCEL format (90% refTrans with homologies) Cicer arietinum   RefTrans V2 vs TrEMBL
RefTrans with homologies  FASTA format Cicer arietinum  RefTrans V2 vs TrEMBL_hit
RefTrans without homologies FASTA format Cicer arietinum RefTrans V2 vs TrEMBL_noHit

 

InterProscan Analysis

InterPro domains and Gene Ontology assignments were made to Cicer arietinum  RefTrans V2 using InterProScan at the EBI through Blast2GO.
Gene Ontology annotations by RefTrans EXCEL format  Cicer arietinum  RefTrans V2 Gene Ontology annotations
InterPro annotations by RefTrans EXCEL format Cicer arietinum  RefTrans V2 InterPro annotations

 

KEGG Analysis

KEGG pathway and ortholog assignments were made to Cicer arietinum  RefTrans V2 using the KEGG / KASS server at http://www.genome.jp/kaas-bin/kaas_main. 
KEGG pathway annotations by RefTrans EXCEL format Cicer arietinum  RefTrans V2 KEGG pathways
KEGG ortholog annotations by RefTrans EXCEL format Cicer arietinum  RefTrans V2 KEGG orthologs

 

Alignments

The alignment tool 'BLAT' was used to map Cicer arietinum  RefTrans V2 to the Cicer arietinum CDC Frontier genome v1 Alignments with an alignment length of 95% and 90% identify were preserved. 
BLAT of refTrans to Cicer arietinum CDC Frontier genome v1.0 EXCEL format Cicer arietinum  RefTrans V2_Cicer arietinum CDC Frontier genome v1.0