The goal of this python package is to provide quick, easy access to breast cancer gene expression data from OSF for analysis.
You can view this python package on its PyPI Site.
Since this is a python package, you must have python installed on your computer. This package calls for Python 3.8 or greater.
To install this package, run the following command in your terminal:
pip install ccbcge
This package is used just like any other Python package. Import the package and then call any of its functions.
import ccbcge as cc
cc.functionCall()
There are four major functions that you can call.
ChooseData()- This allows you see the list of data sets you can access and choose one to download.
- It returns two data sets: the gene expression data, then the meta data for that data set.
ShowDataSets()- This shows the list of data sets you can access.
importData(dataSet)- This takes as input the name of the data set you want to access.
- It returns the gene expression data of that name as a
pandasdata frame.
importMeta(dataSet)- This takes as input the name of the data set you want to access.
- It returns the meta data for that data set as a
pandasdata frame.
The idea behind these functions is that you can access the data frame easily by name if you already know which data set you want to access.
For those who do not know which data sets they want to access, however, they have the option of seeing the data sets and choosing from them the one they want to work with.
If you want to use all of the data sets in a larger analysis, you could write a simple for loop as follows:
for dataSet in dataSets:
geneData = importData(dataSet)
metaData = importMeta(dataSet)
# analyze the data here
After you have downloaded the data into the data frames, you can work with them like any other pandas data frame.
Furthermore, you can write the data to files to keep them for later use without needing to redownload them.
This package requires use of the following packages:
requestpandas
If you do not have them installed already, they will be installed for you.
Help for how to update this package is found in a Developers File
This project is licensed under the Apache 2.0 License - see the LICENSE file for more information.
The idea for this project came from a similar package from the Payne Lab at Brigham Young University, called cptac.
To find out more about that package, visit the CPTAC Github repository
or the CPTAC PyPI package.