-
Notifications
You must be signed in to change notification settings - Fork 0
Upload dendrogram + PCA clustering script #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
| python3 tcga_fpkm_dendrogram.py <filename> | ||
| ``` | ||
|
|
||
| A folder called `results` will be created which will contain the following output files: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you list the generated output files and give a brief description of them here?
| @@ -0,0 +1,53 @@ | |||
| # TCGA_FPKM_dendrogram | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general, we want this script to be a generic dendogram + PCA clustering script, not necessarily focused on the TCGA data set. So it's best not to mention TCGA in the documentation or in the code.
| Once you have activated the `virtualenv`, use the script as follows: | ||
|
|
||
| ```bash | ||
| python3 tcga_fpkm_dendrogram.py <filename> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a brief description of the how the input file should look like? Maybe a small example as well?
| def plot_dendro_interactive(data, labels): | ||
| data_array = tcga_data.to_numpy() | ||
| fig = ff.create_dendrogram(data_array, orientation='left', labels=labels) | ||
| fig.update_layout(width=1600, height=800) | ||
| fig.write_image('./results/TCGA-FPKM_dendrogram-intr.png') | ||
| fig.show() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might as well remove this function since it's no longer used.
| @@ -0,0 +1,96 @@ | |||
| from sklearn.cluster import AgglomerativeClustering | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remember to remove all mentions of TCGA.
| sys.exit() | ||
|
|
||
| # Create directory for results | ||
| path = './results' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a parameter for the user to specify the name of the output directory?
No description provided.