############################################################# README for ftp://ncbi.nlm.nih.gov/refseq/release/ Last updated: February 2, 2015 ############################################################# National Center for Biotechnology Information (NCBI) National Library of Medicine National Institutes of Health 8600 Rockville Pike Bethesda, MD 20894, USA tel: (301) 496-2475 fax: (301) 480-9241 e-mail: info@ncbi.nlm.nih.gov _________________________________________________________________________ Updates to this document: February 2, 2015 updated list of directories (removed microbial, added archaea and bacteria) July 24,2003 fixed typos [last updated date; ftp URL] November 4, 2003 Release 2; new archive subdirectories added for release notes, statistics, and the catalog January 26, 2004 Release 3 installed to the FTP site; new files provided documenting release files (sequences) and records removed since previous release /release/catalog/release3.removed.records /release/catalog/release3.files.installed April 2, 2004 removed references to specific release numbers and dates July 12, 2004 RefSeq release 6 available for FTP Release 6 includes data available on July 5, 2004 September 16, 2004 RefSeq release 7 available for FTP Release 7 includes data available on September 12, 2004 _________________________________________________________________________ Background: ----------- The NCBI RefSeq project is an ongoing effort to provide a curated, non-redundant collection of reference sequences, representative of the central dogma, for each major organism. This first release includes all of the sequence data that we have collected at this time. Although the RefSeq collection is not yet complete, its value as a non-redundant dataset has reached a level that justifies providing full releases. The full release incorporates genomic, transcript, and protein data available at the time of each release. RefSeq Release: --------------- The RefSeq Release is available by anonymous FTP at: ftp://ftp.ncbi.nih.gov/refseq/release/ Release notes documenting the content, scope, organization, size, and format of the release is provided at: ftp://ftp.ncbi.nih.gov/refseq/release/release-notes/ Sequence data is available in the following directories: ftp://ftp.ncbi.nih.gov/refseq/release/archaea/ ftp://ftp.ncbi.nih.gov/refseq/release/bacteria/ ftp://ftp.ncbi.nih.gov/refseq/release/complete/ ftp://ftp.ncbi.nih.gov/refseq/release/fungi/ ftp://ftp.ncbi.nih.gov/refseq/release/invertebrate/ ftp://ftp.ncbi.nih.gov/refseq/release/mitochondrion/ ftp://ftp.ncbi.nih.gov/refseq/release/plant/ ftp://ftp.ncbi.nih.gov/refseq/release/plasmid/ ftp://ftp.ncbi.nih.gov/refseq/release/plastid/ ftp://ftp.ncbi.nih.gov/refseq/release/protozoa/ ftp://ftp.ncbi.nih.gov/refseq/release/vertebrate_mammalian/ ftp://ftp.ncbi.nih.gov/refseq/release/vertebrate_other/ ftp://ftp.ncbi.nih.gov/refseq/release/viral/ Documentation on the contents of the release is available at: ftp://ftp.ncbi.nih.gov/refseq/release/release-catalog/ and includes: listing of all sequence files installed for FTP, and the md5checksum catalog of all accessions included in the release catalog of accessions that were removed since the last release information on added or updated taxons accession-GeneID mapping file Nonredundant (autonomous) protein-Nucleotide, and -Taxon mapping files Release statistics are available at: ftp://ftp.ncbi.nih.gov/refseq/release/release-statistics/ Archival data for prior releases is maintained in subdirectories named 'archive'; archival data maintenance is limited to the /release-catalog/archive/ /release-statistics/archive/ /release-notes/archive/ Future Release plans: --------------------- We plan to provide future RefSeq releases in odd numbered months, namely: January, March, May, July, September, November. Additional Information: ----------------------- Additional information about the RefSeq project is available at: 1. The NCBI RefSeq Web Site: http://www.ncbi.nlm.nih.gov/RefSeq/ 2. The NCBI Handbook The Reference Sequence (RefSeq) Project. Available from: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Books Please send questions, comments, and suggestions concerning the RefSeq release or the RefSeq project to: info@ncbi.nlm.nih.gov