Cajanus cajan cv. Asha genome v2.0

Genome Overview
Analysis NameCajanus cajan cv. Asha genome v2.0
MethodHiSeqXTen and NextGen500 (Assembly with 3D-DNA, Juicebox Assembly Tools)
SourceLegumepedia
Date performed2024-06-26

The genome was sequenced and assembled as outlined in Garg V, Dudchenko O, Wang J, Khan AW, Gupta S, Kaur P, Han K, Saxena RK, Kale SM, Pham M, Yu J. Chromosome-length genome assemblies of six legume species provide insights into genome organization, evolution, and agronomic traits for crop improvement. Journal of advanced research. 2022 Dec 1;42:315-29.

About the assembly:

Number of scaffolds 35,727
Total size  595 Mb
N50 53,895,704 bp
Assembly BUSCO score (embryophyta_odb10) 97.0%
Annotation BUSCO score (embryophyta_odb10) 94.7%
Functional Analysis

Functional annotation files for the Cajanus cajan cv. Asha genome v2.0 are available for download below. The C. cajan cv. Asha genome v2.0 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS).

Downloads

GO assignments from InterProScan Cc_Asha_v2_genes2GO.xlsx.gz
IPR assignments from InterProScan Cc_Asha_v2_genes2IPR.xlsx.gz
Proteins mapped to KEGG Orthologs Cc_Asha_v2_KEGG-orthologis.xlsx.gz
Proteins mapped to KEGG Pathways Cc_Asha_v2_KEGG-pathways.xlsx.gz
Assembly

The Cajanus cajan cv. Asha genome v2.0 assembly file is available in FASTA format.

Downloads

Chromosomes and scaffolds (FASTA file) Cc_Asha_v2.0.fasta.gz
Gene Prediction

The Cajanus cajan cv. Asha genome v2.0 gene prediction files are available in GFF3 and FASTA format.

Downloads

Protein sequences  (FASTA file) Cc_Asha_v2.0.proteins.fasta.gz
CDS  (FASTA file) Cc_Asha_v2.0.cds.fasta.gz
Genes (GFF3 file) Cc_Asha_v2.0.genes.gff3.gz
Homology

Homology of the Cajanus cajan cv. Asha genome v2.0 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-6  for the Arabidoposis proteins (Araport11, 2022-09), UniProtKB/SwissProt (Release 2024-03), and UniProtKB/TrEMBL (Release 2024-03) databases. The best hit reports are available for download in Excel format. 

Protein Homologs

V. angularis cv. Asha genome v2.0 proteins with arabidopsis (Araport11) homologs (EXCEL file) Cc_Asha_v2_vs_tair.xlsx.gz
V. angularis cv. Asha genome v2.0 proteins with arabidopsis (Araport11) (FASTA file) Cc_Asha_v2_vs_tair_hit.fasta.gz
V. angularis cv. Asha genome v2.0 proteins without arabidopsis (Araport11) (FASTA file) Cc_Asha_v2_vs_tair_noHit.fasta.gz
V. angularis cv. Asha genome v2.0 proteins with SwissProt homologs (EXCEL file) Cc_Asha_v2_vs_swissprot.xlsx.gz
V. angularis cv. Asha genome v2.0 proteins with SwissProt (FASTA file) Cc_Asha_v2_vs_swissprot_hit.fasta.gz
V. angularis cv. Asha genome v2.0 proteins without SwissProt (FASTA file) Cc_Asha_v2_vs_swissprot_noHit.fasta.gz
V. angularis cv. Asha genome v2.0 proteins with TrEMBL homologs (EXCEL file) Cc_Asha_v2_vs_trembl.xlsx.gz
V. angularis cv. Asha genome v2.0 proteins with TrEMBL (FASTA file) Cc_Asha_v2_vs_trembl_hit.fasta.gz
V. angularis cv. Asha genome v2.0 proteins without TrEMBL (FASTA file) Cc_Asha_v2_vs_trembl_noHit.fasta.gz