AACR Project GENIE: Data

The first set of cancer genomic data aggregated through AACR Project Genomics Evidence Neoplasia Information Exchange (GENIE) was available to the global community in January 2017.  The sixth data set, GENIE 6.0-public, was released on July 2019. A patch to the GENIE 6.0-public release, GENIE 6.1-public, was subsequently released on July 13, 2019.  The combined data set now includes nearly 70,000 de-identified genomic records collected from patients who were treated at each of the consortium's participating institutions, making it among the largest fully public cancer genomic data sets released to date.

These data will be released to the public every six months. The public release of the seventh data set, GENIE 7.0-public, will take place in January, 2020.    

The combined data set now includes data for nearly 80 major cancer types, including data from nearly 11,000 patients with lung cancer, greater than 9,700 patients with breast cancer, and nearly 7,000 patients with colorectal cancer. 

For the use of the data, consult the data guide.  

Users can access the data directly via cbioportal, or download the data directly from Sage Bionetworks. Users will need to create an account for either site and agree to the terms of access.

For frequently asked questions, visit our FAQ page

Date Updated: 7/13/19