README.md 4.34 KB
Newer Older
Maria Marin's avatar
Maria Marin committed
1
# Data
Maria Marin's avatar
Maria Marin committed
2
This directory contains the data referring to the nodes and links used in this research.
Maria Marin's avatar
Maria Marin committed
3 4 5

## Nodes

Maria Marin's avatar
Maria Marin committed
6 7 8 9 10 11
| DATA                    | DESCRIPTION                                                      | IDENTIFIER                                                               | TOTAL  | SOURCE                                                                                      | ACCESSED DATE        |                                              
|-------------------------|------------------------------------------------------------------|--------------------------------------------------------------------------|--------|--------------------------------------------------------------------------------------------|----------------------|
| Diseases (dis.tsv)      | Data regarding diseases, including their name and identifier     | Unified Medical Language System (UMLS) Concept Unique Identifiers (CUI)  | 30,731 | [UMLS](https://www.nlm.nih.gov/research/umls/knowledge_sources/metathesaurus/index.html)    |    May 2020          |
| Genes (gen.tsv)         | Data relating to genes, including their symbol and identifier    | National Center of Biotechnology Information (NCBI) Identifiers          | 20,610 | [NCBI](https://www.ncbi.nlm.nih.gov/)                                                       |    May 2020          |
| Proteins (prot.tsv)     | Data relating to proteins, including their identifier            | Accession number in UniProt                                              | 18,521 | [UniProt](https://www.uniprot.org/)                                                         |    May 2020          |
| Drugs (dru.tsv)         | Data relating to drugs, including their name and identifier      |  ChEMBL Identifier                                                       | 3,944  | [ChEMBL](https://www.ebi.ac.uk/chembl/)                                                     |    May 2020          |
Maria Marin's avatar
Maria Marin committed
12 13 14 15


## Links

Maria Marin's avatar
Maria Marin committed
16 17 18 19 20 21 22 23
| DATA                              | DESCRIPTION                                                                        | IDENTIFIER                                                     | TOTAL  | SOURCE                                                                            | ACCESSED DATE              |
|-----------------------------------|------------------------------------------------------------------------------------|----------------------------------------------------------------|--------|-----------------------------------------------------------------------------------|----------------------------|
| Disease – Drug (dis_dru_the.tsv)  | Associations between diseases and drugs used for their treatment                   | UMLS CUI – ChEMBL Identifier                                   | 52,179 | [Comparative Toxicogenomics Database (CTD)](http://ctdbase.org/)                  |   May 2020                 |
| Disease – Gene (dis_gen.tsv)      | Associations between diseases and genes whose mutation triggers the disease        | UMLS CUI – NCBI Identifier                                     | 358,209| [DisGeNET](https://www.disgenet.org/)                                             |   May 2020                 |
| Disease – Protein (dis_prot.tsv)  | Associations between diseases and proteins produced from their pathological genes  | UMLS CUI – Accession number in UniProt                         | 361,325| [DisGeNET](https://www.disgenet.org/)                                             |   May 2020                 |
| Gene – Protein (gen_pro.tsv)      | Associations between genes and proteins produced from the gene                     | NCBI Identifier – Accession Number in UniProt                  | 15,770 | [DisGeNET](https://www.disgenet.org/)                                             |   May 2020                 |
| Protein – Protein (pro_pro.tsv)   | Associations between proteins that physically interact with each other             | Accession number in UniProt – Accession number in UniProt      | 439,863| [DisGeNET](https://www.disgenet.org/)                                             |   May 2020                 |
| Drug – Protein (dru_pro.tsv)      | Associations between drugs and the target proteins they affect                     | ChEMBL identifier – Accession number in UniProt                | 5,946  | [ChEMBL](https://www.ebi.ac.uk/chembl/) and [DrugBank](https://go.drugbank.com/) |   May and December 2020    |