This repository contains the complete workflow developed to solve Problem Set 1 for the course Big Data and Machine Learning (2026–10). The project is organized into modular directories that cover each stage of the process, from data scraping and preprocessing to analysis, modeling, and results documentation.
Please run the code contained in the script 00_rundirectory.R to replicate the results.
- 00_data: funciones de scraping para extraer la información. Saving intermediate data.
- 01_code: Code building for clean, regress and prediccion.
- 02_output: These files extract the results from the scripts, such as figures and tables.
- 03_regression: estimación de modelos econométricos.
- 99_other: Files usefull for the results.