# Data This directory contains the data referring to the nodes and links used in this research. ## Nodes | DATA | DESCRIPTION | IDENTIFIER | TOTAL | SOURCE | ACCESSED DATE | |-------------------------|------------------------------------------------------------------|--------------------------------------------------------------------------|--------|--------------------------------------------------------------------------------------------|----------------------| | Diseases (dis.tsv) | Data regarding diseases, including their name and identifier | Unified Medical Language System (UMLS) Concept Unique Identifiers (CUI) | 30,731 | [UMLS](https://www.nlm.nih.gov/research/umls/knowledge_sources/metathesaurus/index.html) | May 2020 | | Genes (gen.tsv) | Data relating to genes, including their symbol and identifier | National Center of Biotechnology Information (NCBI) Identifiers | 20,610 | [NCBI](https://www.ncbi.nlm.nih.gov/) | May 2020 | | Proteins (prot.tsv) | Data relating to proteins, including their identifier | Accession number in UniProt | 18,521 | [UniProt](https://www.uniprot.org/) | May 2020 | | Drugs (dru.tsv) | Data relating to drugs, including their name and identifier | ChEMBL Identifier | 3,944 | [ChEMBL](https://www.ebi.ac.uk/chembl/) | May 2020 | ## Links | DATA | DESCRIPTION | IDENTIFIER | TOTAL | SOURCE | ACCESSED DATE | |-----------------------------------|------------------------------------------------------------------------------------|----------------------------------------------------------------|--------|-----------------------------------------------------------------------------------|----------------------------| | Disease – Drug (dis_dru_the.tsv) | Associations between diseases and drugs used for their treatment | UMLS CUI – ChEMBL Identifier | 52,179 | [Comparative Toxicogenomics Database (CTD)](http://ctdbase.org/) | May 2020 | | Disease – Gene (dis_gen.tsv) | Associations between diseases and genes whose mutation triggers the disease | UMLS CUI – NCBI Identifier | 358,209| [DisGeNET](https://www.disgenet.org/) | May 2020 | | Disease – Protein (dis_prot.tsv) | Associations between diseases and proteins produced from their pathological genes | UMLS CUI – Accession number in UniProt | 361,325| [DisGeNET](https://www.disgenet.org/) | May 2020 | | Gene – Protein (gen_pro.tsv) | Associations between genes and proteins produced from the gene | NCBI Identifier – Accession Number in UniProt | 15,770 | [DisGeNET](https://www.disgenet.org/) | May 2020 | | Protein – Protein (pro_pro.tsv) | Associations between proteins that physically interact with each other | Accession number in UniProt – Accession number in UniProt | 439,863| [DisGeNET](https://www.disgenet.org/) | May 2020 | | Drug – Protein (dru_pro.tsv) | Associations between drugs and the target proteins they affect | ChEMBL identifier – Accession number in UniProt | 5,946 | [ChEMBL](https://www.ebi.ac.uk/chembl/) and [DrugBank](https://go.drugbank.com/) | May and December 2020 |