Tapir is currently indexing what is a suffiently representative sample of reference known and annotated DNA for our research and usage of sequencing data. Although we are increasing the size of that reference set relatively quickly (it has been multiplied by 2 every 2-3 months), do not hesitate to contact us if you would like us to immediately add specific reference DNA of interest, or if you would like general information about the site.
We are working on a comprehensive search interface to look up if an organism of interest is already indexed. In the meantime, the following table gives an overview of the sources of reference DNA.
|Name||# references||# DNA bases|
|Bacterial genomes (NCBI)||4,693||8,584,324,670|
|Viral genomes (NCBI)||1,750||60,637,755|
|Human Microbiome sequences||1,653,700||1,490,442,185|
|Homo sapiens (Hg19)||3,134||2,844,000,504|