Ensembl Tree Shrew

 

Explore the Tupaia belangeri genome

Search Ensembl Tupaia belangeri

Search:

e.g. GeneScaffold_3964 or ENSOCUG00000003166 or Q59FM4.1

Example Data Points

This release of Tupaia belangeri data is assembled into scaffolds, so there are no chromosomes available to browse.

A few example data points :

Jump directly to sequence position

Region:
From (bp):
To (bp):

About the Tupaia belangeri genome

Assembly

Treeshrew This is the first release of the low-coverage 2X assembly of the northern treeshrew(Tupaia belangeri). The genome sequencing and assembly is provided by the Broad Institute.

The N50 size is the length such that 50% of the assembled genome lies in blocks of the N50 size or longer. The N50 length for supercontigs is 88.86 kb and is 2.97 kb for contigs. The total number of bases in supercontigs is 3.66 Gb and in contigs is 2.14 Gb.

Annotation

Owing to the fragmentary nature of this preliminary assembly, it was necessary to arrange some scaffolds into "gene-scaffold" super-structures, in order to present complete genes. There are 6153 such gene-scaffolds which consist of 1.58 Gb , with identifiers of the form "GeneScaffold_1".

Mammalian Genome Project

Tupaia belangeri is one of 16 mammals that will be sequenced as part of the Mammalian Genome Project, funded by the National Institutes of Health (NIH). A group of species were chosen to maximise the branch length of the evolutionary tree while representing the diversity of mammalian species. Low-coverage 2X assemblies will be produced for these mammals and used in alignments for cross-species comparison. The aim is to increase our understanding of functional elements, especially in the human genome.

What's New in Ensembl 49

Tupaia belangeri News

  • Minor updates

    A number of minor changes have been made to the core databases.
    Read more...

General News

  • Release schedule

    Ensembl release 50 will occur in July (rather than in April as originally scheduled).
    Read more...

  • API changes - regulatory features
    Regulatory feature support has been moved from the core API to the functional genomics API. More information about using the new code...
  • Removal of viral genes
    We have removed a total of about 1200 viral genes from the following species.
    Read more...
  • Mart updates
    RGD and SGD Symbol+ID combinations have been introduced, similar to HGNC, MGI and ZFIN. The issue with subsets of homologs being returned has been resolved.

More news...

Statistics

Assembly: tupBel1, Jun 2006
Genebuild: Ensembl, Oct 2006
Database version: 49.1d
Known protein-coding genes: 12
Projected protein-coding genes: 13,098
Novel protein-coding genes: 2,348
Pseudogenes: 2,313
RNA genes: 2,057
Genscan gene predictions: 101,619
Gene exons: 0
Gene transcripts: 0
Base Pairs: 2,137,225,476
Golden Path Length: 3,670,324,638
Most common InterPro domains: Top 40 Top 500

How the statistics are calculated


 

© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details.

                
Ensembl release 49 - Mar 2008
HOME · BIOMART · SITEMAP HELP