​​​​​​​​​​​​​AACR Project GENIE: Data

The first set of cancer genomic data aggregated through AACR Project Genomics Evidence Neoplasia Information Exchange (GENIE) was available to the global community in January 2017. The second data set was released in November 2017 adding more than 13,000 records to the database. The combined data set now includes more than 32,000 de-identified genomic records collected from patients who were treated at each of the consortium's participating ​institutions, making it among the largest fully public cancer genomic data sets released to date.  These data will be continuously updated on a quarterly basis.  

The combined data set now includes data for 59 major cancer types, including data on nearly 5,000 patients with lung cancer, nearly ​4,000 patients with breast cancer, and more than 3,500 patients with colorectal cancer​. 

For more details about the data, and how to use it, consult the data ​guide​​.  Look here to a summary of GENIE data​.

Users can access the data directly via cbioportal, or download the data directly from Sage Bionetworks. Users will need to create an account for either site and agree to the terms of access.

For frequently asked questions, visit our FAQ page.

Date Updated: 11/28/17