CoderData Cancer Omics and Drug Experiment Response Data (`coderdata`) Python Package

Introduction

CoderData is a cancer benchmark data package developed in Python and R. There are two aspects of this package, the backend build section and the user facing python package. The build section is a github workflow that generates four cancer datasets in a format that is easy for users and algorithms to ingest. The python package allows users to easily download the data, load it into python and reformat it as desired.

Broad Sanger Summary

The Broad Sanger datasets were collected from numerous resources such as the LINCS project, DepMap, and the Sanger Institute. This data will allow scientists to explore the drugs response for thousands of drugs across hundreds of cell lines.

Dataset Unique_Entrez_IDs Unique_Sample_IDs
Transcriptomics 19335 1697
Proteomics 12777 1008
Copy_number 19210 1790
Mutations 20394 1729

Visualization

Broad Sanger Figure
Broad Sanger Circos