CoderData Cancer Omics and Drug Experiment Response Data (`coderdata`) Python Package

Introduction

CoderData is a cancer benchmark data package developed in Python and R. There are two aspects of this package, the backend build section and the user facing python package. The build section is a github workflow that generates four cancer datasets in a format that is easy for users and algorithms to ingest. The python package allows users to easily download the data, load it into python and reformat it as desired.

CPTAC Summary

The Clinical Proteomic Tumor Analysis Consortium (CPTAC) project is a collaborative network funded by the National Cancer Institute (NCI) focused on improving our understanding of cancer biology through the integration of transcriptomic, proteomic, and genomic data.

Dataset Unique_Entrez_IDs Unique_Sample_IDs
Transcriptomics 19354 1113
Proteomics 15015 1086
Copy_number 19359 1024
Mutations 18866 833

Visualization

CPTAC Figure
Cell Line Circos