Statistics on modern & ancestral genomes
Genomicus 16.03 contains 937643 genes from 26 extant species.
This data was analysed with a new method called AGORA (Algorithm for Gene Order Reconstruction in Ancestors; Muffato et al. in preparation) to identify 900636 ancestral genes from 24 ancestral species, grouped into 19806 blocks of collinear genes.
The following tables present statistics on the data used in Genomicus, and on the reconstructed blocks in the ancestral species.
- For extant species, a "contig" means a segment of the genome sequence assembly (chromosomes, scaffolds, contig) with at least two genes.
- The average block length, the N25, N50 and N75 values do not take into account the singletons (blocks of one gene).
|
Solanum |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Solanum tuberosum | (extant sp.) | 39021 | 13 | 3001.6 | 3012 | 0 | 100.0% |
Solanum lycopersicum | (extant sp.) | 34675 | 13 | 2667.3 | 2543 | 0 | 100.0% |
|
Fabids |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Prunus persica | (extant sp.) | 27864 | 37 | 752.5 | 3162 | 22 | 99.9% |
Glycine max | (extant sp.) | 54174 | 62 | 872.4 | 2658 | 86 | 99.8% |
Fragaria vesca | (extant sp.) | 32831 | 8 | 4103.9 | 4051 | 0 | 100.0% |
Lotus japonicus | (extant sp.) | 38482 | 1475 | 16.0 | 46 | 14854 | 61.4% |
Populus trichocarpa | (extant sp.) | 41377 | 436 | 93.7 | 2145 | 540 | 98.7% |
|
Brassicaceae |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Arabidopsis lyrata subsp lyrata | (extant sp.) | 32667 | 273 | 119.1 | 4113 | 164 | 99.5% |
Thellungiella parvula | (extant sp.) | 27132 | 233 | 112.9 | 1710 | 815 | 97.0% |
Brassica rapa subsp pekinensis | (extant sp.) | 41018 | 147 | 278.1 | 3999 | 137 | 99.7% |
Capsella rubella | (extant sp.) | 26521 | 65 | 407.1 | 3422 | 58 | 99.8% |
Thellungiella halophila | (extant sp.) | 26351 | 42 | 627.0 | 1997 | 19 | 99.9% |
Arabidopsis thaliana | (extant sp.) | 27416 | 7 | 3916.6 | 5437 | 0 | 100.0% |
|
Other Malvids |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Theobroma cacao | (extant sp.) | 46140 | 11 | 4194.5 | 4217 | 0 | 100.0% |
Carica papaya | (extant sp.) | 27582 | 1252 | 19.7 | 127 | 2859 | 89.6% |
|
Other Dicotyledones |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Vitis vinifera | (extant sp.) | 29927 | 33 | 906.9 | 1450 | 0 | 100.0% |
|
Poaceae |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Sorghum bicolor | (extant sp.) | 34496 | 60 | 572.3 | 3714 | 157 | 99.5% |
Setaria italica | (extant sp.) | 35471 | 31 | 1142.8 | 4216 | 45 | 99.9% |
Oryza sativa Indica Group | (extant sp.) | 40745 | 608 | 64.8 | 3163 | 1371 | 96.6% |
Oryza glaberrima | (extant sp.) | 33164 | 410 | 79.7 | 2508 | 494 | 98.5% |
Brachypodium distachyon | (extant sp.) | 26552 | 11 | 2413.0 | 5990 | 9 | 100.0% |
Hordeum vulgare subsp vulgare | (extant sp.) | 24211 | 66 | 334.4 | 1982 | 2138 | 91.2% |
Oryza sativa Japonica Group | (extant sp.) | 57939 | 16 | 3621.2 | 4703 | 0 | 100.0% |
Zea mays | (extant sp.) | 63331 | 13 | 4871.6 | 6764 | 0 | 100.0% |
Oryza brachyantha | (extant sp.) | 32037 | 130 | 245.8 | 2613 | 80 | 99.8% |
|
Other Monocotyledones |
Age |
Genes |
Contigs |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in contigs |
Coverage (%) |
Musa acuminata subsp malaccensis | (extant sp.) | 36519 | 12 | 3043.2 | 3106 | 0 | 100.0% |
|
Ancestors in Monocotyledones |
Age |
Genes |
Blocks |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in blocks |
Coverage (%) |
Poaceae | ~41 My | 41267 | 567 | 34.2 | 1331 | 21889 | 47.0% |
Oryza | ~40 My | 46230 | 1297 | 23.5 | 1437 | 15766 | 65.9% |
BEP clade | ~40 My | 36190 | 440 | 46.0 | 1489 | 15962 | 55.9% |
Panicoideae | | 35750 | 504 | 42.2 | 1759 | 14485 | 59.5% |
Andropogoneae | ~9 My | 35948 | 815 | 26.6 | 1486 | 14298 | 60.2% |
Pooideae | | 26194 | 365 | 45.7 | 196 | 9519 | 63.7% |
Commelinids | ~100 My | 30826 | 1817 | 2.7 | 2 | 25941 | 15.8% |
Oryza sativa | | 44603 | 413 | 80.6 | 2558 | 11316 | 74.6% |
|
Ancestors in Dicotyledones |
Age |
Genes |
Blocks |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in blocks |
Coverage (%) |
eurosids | ~107 My | 55547 | 1228 | 14.9 | 565 | 37287 | 32.9% |
Camelineae | | 30968 | 279 | 81.7 | 2943 | 8169 | 73.6% |
Anc arath cacao | ~90 My | 40373 | 1063 | 15.8 | 244 | 23595 | 41.6% |
Leguminosae | ~97 My | 39504 | 970 | 16.5 | 321 | 23461 | 40.6% |
Brassicaceae | ~16 My | 36892 | 452 | 47.4 | 2713 | 15477 | 58.0% |
Solanum | ~3 My | 30062 | 448 | 45.3 | 787 | 9774 | 67.5% |
Papilionoideae | ~60 My | 34726 | 1405 | 10.5 | 24 | 19948 | 42.6% |
fabids | ~100 My | 44100 | 1079 | 16.5 | 437 | 26343 | 40.3% |
Arabidopsis | ~5 My | 28735 | 151 | 157.0 | 2710 | 5025 | 82.5% |
Brassiceae | | 37227 | 930 | 25.2 | 2231 | 13833 | 62.8% |
Dicotyledones | ~110 My | 47270 | 1183 | 9.6 | 28 | 35959 | 23.9% |
Eutremeae | | 29109 | 672 | 32.4 | 1885 | 7338 | 74.8% |
rosids | ~110 My | 48801 | 1091 | 14.2 | 229 | 33298 | 31.8% |
Brassicales | | 31249 | 1059 | 12.6 | 31 | 17957 | 42.5% |
Rosaceae | | 26250 | 266 | 58.3 | 577 | 10755 | 59.0% |
|
Other Ancestors |
Age |
Genes |
Blocks |
Average size (nb genes) |
N50 size (nb genes) |
Genes not in blocks |
Coverage (%) |
Magnoliophyta | ~163 My | 42815 | 1312 | 2.7 | 2 | 39337 | 8.1% |