SNP or sequencing error?

Single nucleotide polymorphisms (SNPs) are responsible for much of the genetic variation within our species. This tutorial demonstrates how to find information about SNPs in different databases and some points to consider when deciding whether a discrepant base might be a SNP or an experimental error.

Programs and databases shown in this tutorial are from the National Center of Biotechnology Information (NCBI), the National Cancer Institute (NCI), and Geospiza, Inc.

The first section of this tutorial shows how sequence analysis programs, such as phred and phrap, can be used to measure sequence quality and distinguish potential SNPs from potential errors. The second part shows three ways to determine if a SNP has already been found. Lastly, example sequences are provided from contigs that can be used for SNP hunting.

1. Identity potential SNPs by using information about sequence quality. In this example, trace files were loaded into Geospiza's Finch®-Server, analyzed by Phred, and assembled with Phrap as part of the Finch-Server pipeline.

2. Determine if a SNP has been previously identified by using blastn to compare a sequence to one of three different databases.

A. GenBank (the non-redundant nucleotide sequences)

B. dbSNP (NCBI's SNP database)

C. The NCI CGAP SNP index
3. Sequences to test:

Gene sequence Clinical information (links to OMIM) Phrap Data
COMT Catechol-o-methyltransferase COMT-DQV plot
F13A1 Coagulation Factor VIII, A1 peptide F13A1-DQV plot
HBB Hemoglobin beta chain HBB-DQV plot
LIPC Hepatic Lipase LIPC-DQV plot

4. Web sites used here:
  • BLAST (http://www.ncbi.nlm.nih.gov/BLAST)
  • BLAST dbSNP(http://www.ncbi.nlm.nih.gov/SNP/snpblastByChr.html)
  • CGAP SNP index (http://lpgws.nci.nih.gov:82/perl/snpbr)
  • Additional SNP information can be found at The SNP Consortium site at Cold Spring Harbor

Before you begin, here are some helpful hints:

  • These pages will only work with Netscape versions 4.0 or later and Internet Explorer versions 4.0 and later.

  • In most slides the green arrows point to the link that you should select when doing this activity on your own. Click the green arrows (or purple buttons) to move to the next slide.

  • To see information repeated, reload (or refresh) the page, or return to a previous slide by using the "Back" button on your browser.

  • Most of the links shown on the following pages are not active.

  • It's helpful to have two web browser windows open at the same time so that you can view the tutorial in one window and view the live site in the other. If you're not sure how to do this, use the links below to see a demonstration.

    This series is not meant as a comprehensive description of all available software, merely a quick guide to help you get started.

    Click on the green arrow to begin

    Copyright Geospiza, Inc.


  • FinchTV
    View trace and quality data from chromatograms
    Get FinchTV!
    Exploring DNA Structure Learn about DNA in 3-D.
    Exploring DNA Structure book and CD-ROM
    Molecular Apparel
    DNA T-shirts and more!
    Cool T shirts with molecular structures of DNA, RNA, and proteins

        

        

    Funding for this project was provided by the National Science Foundation's Course, Curriculum, and Laboratory Improvement Program under grants DUE-0088153 and DUE-0127599


    Home  Copyright © Geospiza, Inc. All rights reserved.