Is a tool to retrieve data from Github for our repositories and aggregate data on pull requests.
- Add a line in
gradle.propertieswith a valid github access token with the permissions to read the repositories data
githubAccessToken=1234567890asdfgh
- Set the list of repositories to scan in
gradle.properties - run
./gradlew runDefault - the data will be generated in the folder
data
There are three levels of data collected or aggregated:

_all.csv contains all the aggregated data, over the entire period of observation, 11 weeks, per repository

In each repository folder, repository_name.csv contains the aggregated data per week

In each repository folder, repository_name_week.csv contains the raw data point with stats of the specific pull request
All the times are expressed in days, 24h, so 0.5 means 12 hours. Every time measurement has already been cleaned by weekends, but night hours are still present