Schema for kgTxInfo
Database: hg19    Primary Table: kgTxInfo    Row Count: 77,614
Format description: Various bits of information about a transcript from the txGraph/txCds system (aka KG3)
fieldexampleSQL type info description
name uc001aaa.3varchar(255) values Name of transcript
category nearCodingvarchar(255) values coding/nearCoding/noncoding for now
sourceAcc BC032353.1varchar(255) values Accession of genbank transcript patterned on (may be refSeq)
isRefSeq 0tinyint(3) unsigned range Is a refSeq
sourceSize 1673int(11) range Number of bases in source, excluding poly-A tail.
aliCoverage 0.979677double range Fraction of bases in source aligning.
aliIdRatio 0.995119double range matching/total bases in alignment
genoMapCount 2int(11) range Number of times source aligns in genome.
exonCount 3int(11) range Number of exons (excludes gaps from frame shift/stops)
orfSize 0int(11) range Size of ORF
cdsScore 345.5double range Score of best CDS according to txCdsPredict
startComplete 0tinyint(3) unsigned range Starts with ATG
endComplete 0tinyint(3) unsigned range Ends with stop codon
nonsenseMediatedDecay 0tinyint(3) unsigned range If true, is a nonsense mediated decay candidate.
retainedIntron 0tinyint(3) unsigned range True if has a retained intron compared to overlapping transcripts
bleedIntoIntron 0int(11) range If nonzero number of bases start or end of tx bleeds into intron
strangeSplice 0int(11) range Count of splice sites not gt/ag, gc/ag, or at/ac
atacIntrons 0int(11) range Count of number of at/ac introns
cdsSingleInIntron 0tinyint(3) unsigned range True if CDS is single exon and in intron of other transcript.
cdsSingleInUtr3 0tinyint(3) unsigned range True if CDS is single exon and in 3' UTR of other transcript.
selenocysteine 0tinyint(3) unsigned range If true TGA codes for selenocysteine
genomicFrameShift 0tinyint(3) unsigned range True if genomic version has frame shift we cut out
genomicStop 0tinyint(3) unsigned range True if genomic version has stop codon we cut out
Connected Tables and Joining Fields
      hg19.bioCycPathway.kgID (via kgTxInfo.name)
      hg19.ccdsKgMap.geneId (via kgTxInfo.name)
      hg19.ceBlastTab.query (via kgTxInfo.name)
      hg19.dmBlastTab.query (via kgTxInfo.name)
      hg19.drBlastTab.query (via kgTxInfo.name)
      hg19.foldUtr3.name (via kgTxInfo.name)
      hg19.foldUtr5.name (via kgTxInfo.name)
      hg19.gnfAtlas2Distance.query (via kgTxInfo.name)
      hg19.gnfAtlas2Distance.target (via kgTxInfo.name)
      hg19.humanHprdP2P.query (via kgTxInfo.name)
      hg19.humanHprdP2P.target (via kgTxInfo.name)
      hg19.humanVidalP2P.query (via kgTxInfo.name)
      hg19.humanVidalP2P.target (via kgTxInfo.name)
      hg19.humanWankerP2P.query (via kgTxInfo.name)
      hg19.humanWankerP2P.target (via kgTxInfo.name)
      hg19.keggPathway.kgID (via kgTxInfo.name)
      hg19.kg4ToKg5.newId (via kgTxInfo.name)
      hg19.kgAlias.kgID (via kgTxInfo.name)
      hg19.kgColor.kgID (via kgTxInfo.name)
      hg19.kgProtAlias.kgID (via kgTxInfo.name)
      hg19.kgProtMap2.qName (via kgTxInfo.name)
      hg19.kgSpAlias.kgID (via kgTxInfo.name)
      hg19.kgTargetAli.qName (via kgTxInfo.name)
      hg19.kgXref.kgID (via kgTxInfo.name)
      hg19.knownBlastTab.query (via kgTxInfo.name)
      hg19.knownBlastTab.target (via kgTxInfo.name)
      hg19.knownCanonical.protein (via kgTxInfo.name)
      hg19.knownCanonical.transcript (via kgTxInfo.name)
      hg19.knownGene.name (via kgTxInfo.name)
      hg19.knownGeneMrna.name (via kgTxInfo.name)
      hg19.knownGenePep.name (via kgTxInfo.name)
      hg19.knownIsoforms.transcript (via kgTxInfo.name)
      hg19.knownToAllenBrain.name (via kgTxInfo.name)
      hg19.knownToEnsembl.name (via kgTxInfo.name)
      hg19.knownToGnfAtlas2.name (via kgTxInfo.name)
      hg19.knownToHprd.name (via kgTxInfo.name)
      hg19.knownToLocusLink.name (via kgTxInfo.name)
      hg19.knownToPfam.name (via kgTxInfo.name)
      hg19.knownToRefSeq.name (via kgTxInfo.name)
      hg19.knownToSuper.gene (via kgTxInfo.name)
      hg19.knownToTreefam.name (via kgTxInfo.name)
      hg19.knownToU133.name (via kgTxInfo.name)
      hg19.knownToVisiGene.name (via kgTxInfo.name)
      hg19.mmBlastTab.query (via kgTxInfo.name)
      hg19.rnBlastTab.query (via kgTxInfo.name)
      hg19.scBlastTab.query (via kgTxInfo.name)
Sample Rows
namecategorysourceAccisRefSeqsourceSizealiCoveragealiIdRatiogenoMapCountexonCountorfSizecdsScorestartCompleteendCompletenonsenseMediatedDecayretainedIntronbleedIntoIntronstrangeSpliceatacIntronscdsSingleInIntroncdsSingleInUtr3selenocysteinegenomicFrameShiftgenomicStop
uc001aaa.3nearCodingBC032353.1016730.9796770.995119230345.5000000000000
uc010nxq.1codingAM992880.10148810.99798423402671.833110000000000
uc010nxr.1nearCodingAM992878.10154510.992233230270.5000000000000
uc001aal.1codingNM_001005484.1191811119181577110000000000
uc001aaq.1noncodingDQ597235.10270.92592611210151000000000000
uc001aar.1noncodingDQ599768.10320.7513210142.5000000000000
uc001aav.3nearCodingNR_028327.1042730.9997660.996255340766111000000000
uc001aau.2codingNR_028325.1043700.9997710.9990842311701166110116420000000
uc009vjk.2codingAK293878.1014460.9965420.9923664311701358110000000000
uc010nxu.1codingNM_001005277.119390.997871319361710100000000000

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.