README.md 697 Bytes
Newer Older
Laura Masa's avatar
Laura Masa committed
1 2 3 4 5 6 7 8
The scripts in this folder are organized as follows:

*  download_files.py: Contains functions for downloading and decompressing the files needed for analysis.
*  download_files.ipynb: Jupyter notebook that applies the download functions to retrieve and decompress the files used for analysis.
*  preprocess_functions.py: Contains functions for preprocessing and filtering the data from the files.
*  insert_tables.py: Contains functions for inserting tables into the MySQL database.
*  preprocess_upload_data.ipynb:  Jupyter notebook that preprocesses annotations and inserts all tables into the MySQL database.
*  gpl_main.ipynb: Jupyter notebook used to insert GPL data into the MySQL database.