Bioinformatics Online Home
   Chapters Links Problems Enroll for Updates Help

Home  >  Links

Table 6.5. Databases available on BLAST Web server
Database/Description
A. Peptide Sequence Databases
nr
All non-redundant GenBank CDS translations+RefSeq Proteins+PDB+SwissProt+PIR+PRF
swissprot
Last major release of the SwissProt protein sequence database (no updates)
pat
Proteins from the Patent division of GenPept
Yeast
Yeast (Saccharomyces cerevisiae) genomic CDS translations
ecoli
Escherichia coli genomic CDS translations
pdb
Sequences derived from the three-dimensional structure from Brookhaven Protein Data Bank
Drosophila genome
Drosophila genome proteins provided by Celera and Berkeley Drosophila Genome Project (BDGP)
month
All new or revised GenBank CDS translation+PDB+SwissProt+PIR+PRF released in the last 30 days
B. Nucleotide Sequence Databases
nr
All GenBank+RefSeq Nucleotides+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, or phase 0, 1, or 2 HTGS sequences); no longer "non-redundant"
est
Database of GenBank+EMBL+DDBJ sequences from EST Divisions
est_human
Human subset of GenBank+EMBL+DDBJ sequences from EST Divisions
est_mouse
Mouse subset of GenBank+EMBL+DDBJ sequences from EST Divisions
est_others
Non-Mouse, non-Human sequences of GenBank+EMBL+DDBJ sequences from EST Divisions
gss
Genome survey sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences
htgs
Unfinished high-throughput genomic sequences: phase 0, 1, and 2 (finished, phase 3 HTG sequences are in nr)
pat
Nucleotides from the Patent division of GenBank
yeast
Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences
mito
Database of mitochondrial sequences
vector
Vector subset of GenBank(R), NCBI, in ftp://ftp.ncbi.nih.gov/blast/db
E. coli
Escherichia coli genomic nucleotide sequences
pdb
Sequences derived from the three-dimensional structure from Brookhaven Protein Data Bank
Drosophila genome
Drosophila genome provided by Celera and Berkeley Drosophila Genome Project (BDGP)
month
All new or revised GenBank+EMBL+DDBJ+PDB sequences released in the last 30 days
alu
Select Alu repeats from REPBASE, suitable fro masking Alu repeats from query sequences. It is available by anonymous FTP from ftp.ncbi.nih.gov (under the /pub/jmc/alu directory). See "Alu alert" by Claverie and Makalowski (1994)
dbsts
Database of GenBank+EMBL+DDBJ sequences from STS Divisions
chromosome
Searches complete genomes, complete chromosome, or contigs from the NCBI Reference Sequence project
C. Human Genome Blast Databases
genome
Human genomic contig sequences with NT_#### accessions
mrna
Human RefSeq mrna with NM_#### or XM_#### accessions
protein
Human RefSeq proteins with NP_#### or XP_#### accessions
gscan mrna
Predicted mRNA sequences generated by running GenomeScan program on human genomic contigs
gscan protein
CDS translations from gscan mrna set
D. CDD Search
        Compares protein sequences to the conserved Domain Database. The CDD is a database containing a collection of functional and/or structural domain derived from two popular collections, Smart and Pfam, plus contributions from colleagues at NCBI. For more information, see the CDD homepage.
        Source: http://www.ncbi.nlm.nih.gov/blast/html/blastcgihelp.html#protein_databases

 

© 2004 by Cold Spring Harbor Laboratory Press. All rights reserved.
No part of these pages, either text or image, may be used for any purpose other than personal use. Therefore, reproduction, modification, storage in a retrieval system, or retransmission, in any form or by any means, electronic, mechanical, or otherwise, for reasons other than personal use, is strictly prohibited without prior written permission.

 

 
Home Chapters Links Problems Enroll for Updates Help CSHL Press