Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

NCBI Datasets BETA

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. Find and download gene, transcript, protein and genome sequences, annotation and metadata.

What's new

NCBI Insights July 14, 2021

Introducing the new NCBI Datasets Genomes page

The updated NCBI Datasets Genomes page now has genome data for all domains of life, including …

NCBI Insights June 22, 2021

June 30 Webinar: Using NCBI Datasets to download sequence and annotation for genomes and genes

Join us on June 30, 2021 at 12PM eastern time to learn how to use the …

NCBI Insights April 20, 2021

New NCBI Datasets home and documentation pages provide easier access

NCBI Datasets, the new set of services for downloading genome assembly and annotation data (previous Datasets …

More news

Genomes

Quickstart

Browse and download genome data using our Genome page. Genome data is also available using our command-line tool and API. Genome data includes genome, transcript and protein sequences, genome annotation and metadata.

Browse genomes

Popular species

How to

Genes

Quickstart

Create a customized Gene table to view and download gene data. Gene data is also available through our command-line tool and API. Gene data includes gene, transcript and protein sequences organized by gene.

Get started

Examples

How to

Viruses

SARS-CoV-2 genomes

Quickstart

Download SARS-CoV-2 and other coronavirus genome and protein sequences on the web or through our command-line tool and API. Filter by host and release date.

Genomes

SARS-CoV-2 proteins

Quickstart

Download specific SARS-CoV-2 protein sequences on the web or through our command-line tool and API.

Proteins

How to

Command-line tools

Quickstart

Retrieve gene, genome and coronavirus data from the command-line. The Datasets and Dataformat command-line tools are available for Windows, Mac and Linux systems.

Install tool

How to