In This Section

AACR Project GENIE: Data

The first set of cancer genomic data aggregated through AACR Project Genomics Evidence Neoplasia Information Exchange (GENIE) was available to the global community in January 2017.  The eighth data set, GENIE 8.0-public, was released in July 2020.  A patch to the GENIE 8.0-public release, GENIE 8.1-public, was subsequently released on November 6, 2020. The combined data set now includes more than 95,900 de-identified genomic records collected from patients who were treated at each of the consortium’s participating institutions, making it among the largest fully public cancer genomic data sets released to date.  These data will be released to the public every six months. The public release of the ninth data set, GENIE 9.0-public, will take place in January 2021.    

The combined data set now includes data from 100 major cancer types, including data from greater than 14,000 patients with lung cancer, nearly 12,000 patients with breast cancer, and nearly 9,500 patients with colorectal cancer.

For more details about the data, analyses, and summaries of the data attributes from this release, GENIE 8.0-public, consult the data guide.  Newly added: Notes regarding our new data release can be found here.

Users can access the data directly via cbioportal, or download the data directly from Sage Bionetworks. This handy cbioPortal tutorial will help you navigate through the site. Users will need to create an account for either site and agree to the terms of access.

For frequently asked questions, visit our FAQ page

Date Updated: 11/09/2020