read_data.pyis for read in data;getcox.pyis to calculate the cox-score of each gene when the dataset is given; also report the histogram of the cox-scores of genes;f.pywill get the cox-regression result in the test dataset, while the PCA model is built on train dataset with the qualified genes; those genes have cox-score > or equal than the given thresholdchange_col_namescreated a dictionary using V columns names as key and probes names as values.sampling_the_data.pywill get the balanced 5 train/test folds based on key feature: MRD day 29; Age; WBC at diagnosis.1_gene_expression.mdtesting R Markdownproject_notebook.ipynbjupyter notebook showing the the read in dataframe and the cox-score histogram based on the 207 patients.get_5_testfolds.ipynbjupyter notebook showing how to do stratified resampling to get balanced 5 train/test datasets.
-
Notifications
You must be signed in to change notification settings - Fork 3
ourteam2017/Bio_programmingI-
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published