​​​​​​​​​​​​​​​​AACR Project GENIE: Data

The first set of cancer genomic data aggregated through AACR Project Genomics Evidence Neoplasia Information Exchange (GENIE) was available to the global community in January 2017.  The third data set, GENIE 3.0, was released in January 2018 adding more than 7,500 records to the database. The combined data set now includes more than 39,000 de-identified genomic records collected from patients who were treated at each of the consortium's participating institutions, making it among the largest fully public cancer genomic data sets released to date.  These data will be released to the public every six months. The public release of the fourth data set, GENIE 4.0-public, will take place in July, 2018.    

The combined data set now includes data for over 60 major cancer types, including data from greater than 6,000 patients with lung cancer, nearly 4,500 patients with breast cancer, and more than 4,300 patients with colorectal cancer . 

For more details about the data, analyses and summaries of the data attibutes can be visualized here. For the use of the data,  consult the data guide

Users can access the data directly via cbioportal, or download the data directly from Sage Bionetworks. Users will need to create an account for either site and agree to the terms of access.

For frequently asked questions, visit our FAQ page

Date Updated: 4/17/18