CoderData¶

Usage
API Reference
Datasets
Tutorials
Contributing

Introduction¶
There is a recent explosion of deep learning algorithms that exist to tackle the computational problem of predicting drug treatment outcome from baseline molecular measurements. To support this, we have built a benchmark dataset that harmonizes diverse datasets to better assess algorithm performance. Cancer Omics and Drug Experiment Response Data (CoderData) is a comprehensive package designed for handling cancer benchmark data in Python. This package collects diverse sets of paired molecular datasets with corresponding drug sensitivity data. All data here is reprocessed and standardized so it can be easily used as a benchmark dataset for AI learning algorithms.
CoderData provides functionality to manage datasets in a flexible way. Users can download datasets, load them into computing environments, and reformat them according to their specific needs. It gathers and restructures 16 datasets from multiple databases, harmonizing dataset structure and format, therefore making the dataset accessible for both users and algorithms. Curation of molecular datasets with drug sensitivity data that is AI ready allows researchers to develop advanced predictive models. By simplifying data utility and ensuring readiness for AI applications, CoderData minimizes the barriers with benchmarking models for cancer treatment prediction.
Install¶
CoderData is a Python package that can be installed in any Python environment and utilized in any Integrated Development Environment (IDE) for data processing and manipulation.
coderdata
requires python>=3.9
to be installed.
$ python --version
Python 3.13.1
If a Python version older than 3.9 is installed please refer to the instruction at python.org on how to install / update Python.
The preferred way to install coderdata
is via pip
.
Executing the command below will install the most recent published version of coderdata
including all required dependencies.
$ pip install coderdata
To check if the package has been successfully installed open an interactive python terminal and import the package.
>>> import coderdata as cd
>>> cd.__version__
'2.1.0'