-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
infrastructureDependency upgrades, refactors, etc.Dependency upgrades, refactors, etc.
Description
A few things we can do
- Perform the upsert queries as we reach the requisite batch size while reading JSON files, not all at the end (which leads to significant memory consumption for a lot of opinion data)
- Download CL tar files to disk rather than holding them in memory so as to consume less memory
- Parallelize downloads but not tar extraction (faster downloads but don't kill RAM usage)
Metadata
Metadata
Assignees
Labels
infrastructureDependency upgrades, refactors, etc.Dependency upgrades, refactors, etc.