Schema for Vega Pseudogenes - Vega Annotated Pseudogenes and Immunoglobulin Segments
Database: hg19    Primary Table: vegaPseudoGene    Row Count: 11,590
Format description: A gene prediction with some additional info.
fieldexampleSQL type info description
bin 585smallint(5) unsigned range Indexing field to speed chromosome range queries.
name OTTHUMT00000362751varchar(255) values Name of gene (usually transcript_id from GTF)
chrom chr1varchar(255) values Reference sequence chromosome or scaffold
strand +char(1) values + or - for strand
txStart 11868int(10) unsigned range Transcription start position
txEnd 14409int(10) unsigned range Transcription end position
cdsStart 14409int(10) unsigned range Coding region start
cdsEnd 14409int(10) unsigned range Coding region end
exonCount 3int(10) unsigned range Number of exons
exonStarts 11868,12612,13220,longblob   Exon start positions
exonEnds 12227,12721,14409,longblob   Exon end positions
score 0int(11) range  
name2 RP11-34P13.1varchar(255) values Alternate name (e.g. gene_id from GTF)
cdsStartStat noneenum('none','unk','incmpl','cmpl') values enum('none','unk','incmpl','cmpl')
cdsEndStat noneenum('none','unk','incmpl','cmpl') values enum('none','unk','incmpl','cmpl')
exonFrames -1,-1,-1,longblob   Exon frame {0,1,2}, or -1 if no frame for exon
Connected Tables and Joining Fields
      hg19.vegaGene.name (via vegaPseudoGene.name)
      hg19.vegaGtp.transcript (via vegaPseudoGene.name)
      hg19.vegaPep.name (via vegaPseudoGene.name)
Sample Rows
binnamechromstrandtxStarttxEndcdsStartcdsEndexonCountexonStartsexonEndsscorename2cdsStartStatcdsEndStatexonFrames
585OTTHUMT00000362751chr1+11868144091440914409311868,12612,13220,12227,12721,14409,0RP11-34P13.1nonenone-1,-1,-1,
585OTTHUMT00000002844chr1+12009136701367013670612009,12178,12612,12974,13220,13452,12057,12227,12697,13052,13374,13670,0RP11-34P13.1nonenone-1,-1,-1,-1,-1,-1,
585OTTHUMT00000002839chr1-144032957029570295701114403,15004,15795,16606,16857,17232,17605,17914,18267,24737,29533,14501,15038,15947,16765,17055,17368,17742,18061,18366,24891,29570,0RP11-34P13.2nonenone-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,-1,
585OTTHUMT00000003224chr1+62947638876388763887162947,63887,0OR4G11Pnonenone-1,
586OTTHUMT00000003691chr1+1311031339231339231339231131103,133923,0RP11-34P13.10nonenone-1,
586OTTHUMT00000007034chr1-1352461380391380391380392135246,137568,136006,138039,0RP11-34P13.11nonenone-1,-1,
586OTTHUMT00000007241chr1-2283182287752287752287751228318,228775,0AP006222.1nonenone-1,
587OTTHUMT00000008000chr1+3260953281123281123281122326095,327347,326569,328112,0RP4-669L17.8nonenone-1,-1,
587OTTHUMT00000127609chr1-3294253322433322433322431329425,332243,0RP4-669L17.9nonenone-1,
587OTTHUMT00000007996chr1+3297833342713342713342712329783,334128,329976,334271,0RP4-669L17.4nonenone-1,-1,

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Vega Pseudogenes (vegaPseudoGene) Track Description

Description and Methods

This track shows gene annotations from the Vertebrate Genome Annotation (Vega) database. Annotations are divided into two subtracks from the Vega Human Genome Annotation project:

The following information is an excerpt from the Vertebrate Genome Annotation home page:

"The Vega database is designed to be a central repository for high-quality, frequently updated manual annotation of different vertebrate finished genome sequence. Vega attempts to present consistent high-quality curation of the published chromosome sequences. Finished genomic sequence is analysed on a clone-by-clone basis using a combination of similarity searches against DNA and protein databases as well as a series of ab initio gene predictions (GENSCAN, Fgenes). The annotation is based on supporting evidence only."

"In addition, comparative analysis using vertebrate datasets such as the Riken mouse cDNAs and Genoscope Tetraodon nigroviridis Ecores (Evolutionary Conserved Regions) are used for novel gene discovery."

Display Conventions and Configuration

This track follows the display conventions for gene prediction tracks. Transcript type (and other details) may be found by clicking on the transcript identifier which forms the outside link to the Vega transcript details page. Further information on the gene and transcript classification may be found here.

Credits

Thanks to Steve Trevanion at the Wellcome Trust Sanger Institute for providing the GTF and FASTA files for the Vega annotations. Vega acknowledgements and publications are listed here.