would be nice to refactor the script to a notebook format, so users can run the whole set of tools in Google Colab