logo
Statistics of Sesamum indicum Genomics Data
 ♣Summary of Sesame genome assembly and annotation
 
Assembly   Number N50 (size/number) N90 (size/number) Total length
Contigs All 26,277 52.17Kb/1,545 11.40 Kb / 5,534 270 Mb
Scaffolds All   2.10 Mb / 42 268.23 Kb / 169 274 Mb
Anchored on chromosomes 150 -- -- 234 Mb
Anchored on chromosomes and oriented 117 -- -- 207 Mb
Annotation   Number Total length Percentage of the assembly
Protein coding genes All 27,148 86.08 Mb 31.46
Transposable elements All -- 78.86 Mb 28.46
LTR-Retroelements -- 48.03 Mb 17.56
non-LTR Retrotransposons* -- 11.70 Mb 4.28
DNA transposons -- 10.88Mb 3.98
Unknown -- 14.64 Mb 5.35
Noncoding RNAs rRNAs 386 89.66Kb < 0.04
tRNAs 870 65.31Kb < 0.03
miRNAs 207 25.41Kb < 0.01
snRNAs268 33.93Kb< 0.02
 
 ♣Statistic information of the scaffolds anchored on each sesame linkage group
 
Linkage group Number of markers Number of scaffolds (all) Number of scaffolds (oriented) Total length (bp, with NNs) Total length (bp,withoutNNs)
LG1 32 10 9 18,577,331 18,353,930
LG2 26 8 7 18,500,646 18,309,402
LG3 48 14 12 24,928,530 24,586,084
LG4 43 18 10 17,356,267 16,975,142
LG5 33 13 9 18,898,134 18,612,917
LG6 36 13 12 25,289,714 25,012,497
LG7 30 14 10 11,725,536 11,519,752
LG8 27 9 8 21,523,998 21,308,197
LG9 14 6 6 12,411,895 12,246,513
LG10 24 10 7 17,245,970 17,055,383
LG11 27 9 7 15,446,199 15,265,867
LG12 19 6 6 6,373,461 6,278,374
LG13 17 7 6 5,050,363 4,947,375
LG14 6 4 2 4,882,680 4,824,773
LG15 14 5 4 10,047,770 9,943,669
LG16 7 4 2 4,963,887 4,883,938
Total 403 150 117 233,222,381 230,123,813
 
 ♣Comparison of the gene structure amongasterids and rosids clades
 
  Sesame Potato Tomato Arabidopsis Soybean Poplar Grape
Genome assembly size* (Mb) 273.60 682.70 737.64 119.48 955.05 403.75 470.21
# Genes 27,148 39,031 34,763 26,637 55,787 45,033 26,346
# Exons 128,461 135,708 157,368 139,382 331,060 224,259 156,765
# Introns 101,313 96,677 122,605 112,745 275,273 179,226 130,419
Mean exon per gene 4.73 3.48 4.53 5.23 5.93 4.98 5.95
Mean exon length (bp) 249.45 266.58 228.78 237.50 206.26 231.14 191.10
Mean CDS length (bp) 1180.37 926.88 1035.65 1242.78 1224.01 1151.06 1137.11
Mean intron length (bp) 439.14 621.43 540.63 157.54 423.71 347.09 969.55
Mean transcripts length (bp) 3170.84 2936.33 3163.36 1909.57 3816.24 2916.61 6454.02
 
 ♣Non-coding genes in the sesame genome
 
Type Copy Number Average Length (bp) Total Length (bp)
miRNA 207 122.73 25,405
tRNA 870 75.06 65,305
rRNA rRNA 386 232.29 89,664
18S 197 344.24 67,815
28S 124 122.91 15,241
5.8S 33 126.88 4,187
5S 32 75.66 2,421
snRNA snRNA 268 126.60 33,930
CD-box 118 101.88 12,022
HACA-box 21 122.38 2,570
splicing 129 149.91 19,338
 
 ♣Repeat elements in the sesame genome.
 
 

RepBase TEs

TE Protiens De novo Combined TEs
  Length (bp) %in genome Length (bp) % in genome Length (bp) % in genome Length (bp) % in genome
DNA 2,820,309 1.03 2,547,265 0.93 8,079,254 2.95 10,881,659 3.98
LINE 1,192,426 0.44 7,477,236 2.73 7,701,075 2.82 11,571,539 4.23
LTR 10,197,999 3.73 17,262,796 6.31 39,149933 14.31 48,030,533 17.56
SINE 25,695 0.01 0 0 101,023 0.04 124,172 0.05
Other 4,036 0 0 0 0 0 4,036 0
Unknown 15,738 0.01 14,589 0.01 14,614,303 5.34 14,643,856 5.35
Total 14,006,771 5.12 27,290,716 9.98 63,724,637 23.29 77,856,077 28.46

 

 

2013 @ All copyright are reserved by Department of Genomics and Molecular Biology at Oil Crops Research Institute.
For suggestions, please contact web Administrator
IE 5.5 & 1024×768 Resolution Suggested