AACR Project GENIE: Data

The first set of cancer genomic data aggregated through AACR Project Genomics Evidence Neoplasia Information Exchange (GENIE) was available to the global community in January 2017.  The fifth data set, GENIE 5.0-public, was released in January 2019 adding more than 11,000 records to the database. The combined data set now includes nearly 60,000 de-identified genomic records collected from patients who were treated at each of the consortium's participating institutions, making it among the largest fully public cancer genomic datasets released to date.  These data will be released to the public every six months. The public release of the sixth data set, GENIE 6.0-public, will take place in July 2019.    

The combined data set now includes data for over 80 major cancer types, including data from greater than 9,000 patients with lung cancer, nearly 8,700 patients with breast cancer, and more than 6,000 patients with colorectal cancer.

For more details about the data, analyses and summaries of the data attributes from our previous release, GENIE 4.0-public, can be visualized here. consult the data guide

Users can access the data directly via cbioportal, or download the data directly from Sage Bionetworks. Users will need to create an account for either site and agree to the terms of access.

For frequently asked questions, visit our FAQ page

Date Updated: 3/19/19