README.md 720 Bytes
Newer Older
Laura Masa's avatar
Laura Masa committed
1 2
The scripts in this folder are organized as follows:

Laura Masa's avatar
Laura Masa committed
3
* **download_files.py**: Contains functions for downloading and decompressing the files needed for analysis.
Laura Masa's avatar
Laura Masa committed
4 5 6 7 8
*  **download_files.ipynb**: Jupyter notebook that applies the download functions to retrieve and decompress the files used for analysis.
*  **preprocess_functions.py**: Contains functions for preprocessing and filtering the data from the files.
*  **insert_tables.py**: Contains functions for inserting tables into the MySQL database.
*  **preprocess_upload_data.ipynb**:  Jupyter notebook that preprocesses annotations and inserts all tables into the MySQL database.
*  **gpl_main.ipynb**: Jupyter notebook used to insert GPL data into the MySQL database.