SNPs (130) Track Settings

Simple Nucleotide Polymorphisms (dbSNP build 130 - Provisional Mapping to GRCh37)

Display mode:   
Include Chimp state and observed human alleles in name:  
(If enabled, chimp allele is displayed first, then '>', then human alleles).  

On details page, show function and coding differences relative to:
Vega Protein Genes Vega Pseudogenes UCSC Genes CCDS
RefSeq Genes Other RefSeq Ensembl Genes N-SCAN
SGP Genes Geneid Genes Exoniphy

Minimum Average Heterozygosity: 
Maximum Weight:  SNPs with higher weights are less reliable


Filter by Attribute
Check the boxes below to include SNPs with those attributes. In order to be displayed, a SNP must pass the filter for each category. Some assemblies may not contain any SNPs that have some of the listed attributes.

Class:   
Unknown Single Nucleotide Polymorphism In/Del Heterozygous
Microsatellite Named No Variation Mixed
Mnp Insertion Deletion
Validation:   
Unknown By Cluster By Frequency By Submitter
By 2 Hit / 2 Allele By HapMap
Function:   
Unknown Locus Coding - Synonymous Coding - Non-Synonymous
Untranslated Intron Splice Site Reference (coding)
Molecule Type:   
Unknown Genomic cDNA


SNP Feature for Color Specification:  

The selected feature above has the following values below. For each value, a selection of colors is available. If a SNP has more than one of these properties, resulting in more than one color, then the stronger color will override the weaker color. In order from strongest to weakest, the colors are red, green, blue, gray, black.

Unknown    Locus    Coding - Synonymous    Coding - Non-Synonymous    
Untranslated    Intron    Splice Site    Reference (coding)    

View table schema

Data last updated: 2011-12-22

Description

This track contains information about single nucleotide polymorphisms and small insertions and deletions (indels) — collectively Simple Nucleotide Polymorphisms — from a provisional release of dbSNP build 130, available from ftp.ncbi.nih.gov/snp. dbSNP build 130 was officially released with genomic coordinates for the NCBI build 36 human genome; dbSNP provided a provisional release of genomic coordinates mapped from NCBI build 36 to GRCh37 (see ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606/misc/exchange/README.txt).

Interpreting and Configuring the Graphical Display

Variants are shown as single tick marks at most zoom levels. When viewing the track at or near base-level resolution, the displayed width of the SNP corresponds to the width of the variant in the reference sequence. Insertions are indicated by a single tick mark displayed between two nucleotides, single nucleotide polymorphisms are displayed as the width of a single base, and multiple nucleotide variants are represented by a block that spans two or more bases.

The configuration categories reflect the following definitions (not all categories apply to this assembly):

You can configure this track such that the details page displays the function and coding differences relative to particular gene sets. Choose the gene sets from the list on the SNP configuration page displayed beneath this heading: On details page, show function and coding differences relative to. When one or more gene tracks are selected, the SNP details page lists all genes that the SNP hits (or is close to), with the same keywords used in the function category. The function usually agrees with NCBI's function, but can sometimes give a bit more detail (e.g. more detail about how close a near-gene SNP is to a nearby gene).

Insertions/Deletions

dbSNP uses a class called 'in-del'. We compare the length of the reference allele to the length(s) of observed alleles; if the reference allele is shorter than all other observed alleles, we change 'in-del' to 'insertion'. Likewise, if the reference allele is longer than all other observed alleles, we change 'in-del' to 'deletion'.

UCSC Annotations

UCSC checks for several unusual conditions that may indicate a problem with the mapping, and reports them in the Annotations section if found:

Another condition, which does not necessarily imply any problem, is noted:

UCSC Re-alignment of flanking sequences

dbSNP determines the genomic locations of SNPs by aligning their flanking sequences to the genome. UCSC displays SNPs in the locations determined by dbSNP, but does not have access to the alignments on which dbSNP based its mappings. Instead, UCSC re-aligns the flanking sequences to the neighboring genomic sequence for display on SNP details pages. While the recomputed alignments may differ from dbSNP's alignments, they often are informative when UCSC has annotated an unusual condition.

Non-repetitive genomic sequence is shown in upper case like the flanking sequence, and a "|" indicates each match between genomic and flanking bases. Repetitive genomic sequence (annotated by RepeatMasker and/or the Tandem Repeats Finder with period <= 12) is shown in lower case, and matching bases are indicated by a "+".

Data Sources

The data that comprise this track were extracted from database dump files and headers of fasta files downloaded from NCBI. The database dump files were downloaded from ftp://ftp.ncbi.nih.gov/snp/organisms/ organism_tax_id/database/ (e.g. for Human, organism_tax_id = human_9606). The fasta files were downloaded from ftp://ftp.ncbi.nih.gov/snp/organisms/ organism_tax_id/rs_fasta/

Orthologous Alleles (human assemblies only)

Beginning with the March 2006 human assembly, we provide a related table that contains orthologous alleles in the chimpanzee and rhesus macaque assemblies. Beginning with dbSNP build 129, the orangutan assembly is also included. We use our liftOver utility to identify the orthologous alleles. The candidate human SNPs are a filtered list that meet the criteria:

In some cases the orthologous allele is unknown; these are set to 'N'. If a lift was not possible, we set the orthologous allele to '?' and the orthologous start and end position to 0 (zero).

Masked FASTA Files (human assemblies only)

FASTA files that have been modified to use IUPAC ambiguous nucleotide characters at each base covered by a single-base substitution are available for download here. Note that only single-base substitutions (no insertions or deletions) were used to mask the sequence, and these were filtered to exlcude problematic SNPs.

References

Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2001 Jan 1;29(1):308-11.