In This Section

 

BPC PANC 1.0 – Public 

The GENIE BPC PANC v1.0-public dataset contains 1,109 PANC patients from 4 institutions: MSK, DFCI, UHN and VICC.  

Data Access: 

  • A subset of the data is available through cBioPortal Both the genomic and clinical (phenomic) data can be evaluated in cBioPortal with opportunities for data exploration and visualization using a user-friendly interface.  
  • The complete, post-processed data are available on Synapse.   

See an overview of the data set.

What is included in GENIE BPC data?  

  • Genomic Data: Clinical-grade next-generation sequencing data for each patient from the GENIE Registry. Genomic profiling was performed between 2013 and 2018; patients were aged 24-88 at the time of genomic sequencing. 
  • Cancer Diagnosis: Pancreatic cancer diagnosis is considered the index tumor for this patient cohort. There are data about other cancer diagnoses antecedent and subsequent to the pancreatic cancer. 
  • Pancreatic cancer-specific fields: includes tumor resectability status (Resectable, Borderline Resectable, Unresectable/Locally Advanced or Metastatic) at the time of clinical staging prior to any cancer-directed treatment, were also collected. 
  • Pathologic Information: Each pathology specimen from diagnosis through death or last follow-up is curated with specimen type, site, and histology.  
  • Treatment Histories: All anti-neoplastic systemic therapies–intravenous and oral chemotherapies–are included in the data set. Dates are provided as intervals from diagnosis to start and stop of each drug. Investigational drugs are masked; no dosing information is included.  
  • Imaging Information: Each CT, MRI, PET-CT scan from diagnosis through death or last follow-up is curated for the presence or absence of cancer and an evaluation of whether the cancer was stable, responding, or progressing. These data are used to compute progression-free survival-imaging (PFS-I). Sites of tumor involvement are also recorded.   
  • Medical Oncologist’s Evaluations: Medical oncology notes (1/month) have been curated to ascertain the presence or absence of cancer and whether the cancer was stable, responding, or progressing. These data are used to compute progression-free survival-medonc (PFS-M) from diagnosis through death or date of last follow-up. 
  • Overall Survival: Overall survival is based on death, with censoring at the date last known alive. Ascertainment of death varies by institution.  
  • Additional Relevant Biomarkers: Information about select biomarkers not included on the NGS panels, including PD-L1, and MMR are also curated.  
  • Patient-Reported Outcomes: No patient-reported outcomes are available in this dataset. 
  • Date Masking: Exact dates are masked to preserve confidentiality; however, date intervals are available, allowing calculation of event times such as diagnosis, treatment start, treatment end, PFS-I, PFS-M, and OS 
  • Analytical Data Guide: A more comprehensive overview of the data can be found in the data guide, and a description and location of the variables collected can be found in the variable synopsis spreadsheet.
  • Other Resources: There is a dedicated project wiki that describes each of the files. 
  • Training Videos:  
  • Demo of GENIE Data on the Synapse and cBioPortal Platforms: here 
  • BPC- specific cBioPortal video training playlist: here