Elena Abad - 02 August 2024
Knowledge Graphs for Drug Discovery: Transforming Target Identification and Biomedical Data Integration
Leveraging Relationships in Knowledge Graphs to Enhance Drug Discovery and Target Identification
Relational databases for scientific data management
Currently, the search for targets in drug repositioning and new targets for disease treatment requires integrating complex biological and medical data sets. These include multi-scale omics data sets with genomic, proteomic, or metabolomic information (big data) and clinical data that need proper integration. To store and integrate heterogeneous datasets within biomedical research, researchers have habitually and successfully used relational databases. In these databases, information is organized in structured thematic tables with identifiers connecting these tables. This model allows for large volume storage, easy expansion, secure access, and well-defined usage rules [1]. It has been a standard paradigm in computer data management since its inception in the 1970s by Edgar Frank Codd at IBM Laboratories. Moreover, it has been used across various scientific, technological, and industrial fields.
The emergence of networks: inferring knowledge from interactions
Advances in computing and the alliance between biology and mathematics have led to a revolution in data management, structuring, and analysis. Many biological and biomedical challenges inspire new mathematics, offering solutions and theoretical approaches to quantify and infer knowledge that otherwise would remain hidden [2]. A prime example is the use of networks in biology. A network or graph is a data set composed of vertices or nodes (concepts) and edges (relationships between nodes). Systems biology, a highly interdisciplinary branch that has developed remarkably over more than 20 years, employs networks as a primary method in its holistic approach to biological processes. Biological processes interact internally and externally within cells, tissues, or organs. Therefore, studying interrelationships and connectivity is a hallmark of biological systems.
Biological applications of networks
Today, biological networks are a common representation model, providing insights into the interactive and dynamic complexity of biological processes. Well-known examples include gene regulation and coexpression networks (comprising transcription factors, genes, and associated proteins), intracellular and extracellular signaling networks, metabolic networks (relationships between chemical compounds), and protein-protein interaction networks (with protein entities as nodes and their interactions as links). These methodologies have enabled the identification of gene clusters underlying pathological mechanisms and important disease-associated mutations [3-7].
A paradigm shift in scientific data management: the knowledge graphs
The benefits of graphs in fields where entity relationships are prominent led to the use of graphs as databases. This resulted in the emergence of knowledge graphs (KGs). A KG is a database model based on graph architecture, incorporating information from different entities, their relationships, and associated metadata. KGs facilitate network analysis, enabling the identification of operational modules, the measurement of element relevance, detection of variations in the modeled knowledge scheme, and derivation of data insights from previously unknown connections [8-11]. Today, KGs are favored for their flexible structure, fast query retrieval, intuitive nature, and ease of use, as demonstrated by Google's incorporation of KG into its search engine tools.
At ZeClinics, we work with public and proprietary biomedical datasets, employing the appropriate computational technology to construct a company knowledge graph. This enhances our analytical strategy for identifying new protein and/or gene targets and drug discovery. We believe the benefits of knowledge graphs as databases significantly surpass those of relational databases for targeting purposes.
Unlock the Future of Drug Discovery
Ready to revolutionize your drug discovery process with knowledge graphs?
REFERENCES
[1] Rice M, Gladstone W, Weir M. Relational databases: a transparent framework for encouraging biology students to think informatically. Cell Biol Educ. 2004 Winter;3(4):241-52. doi: 10.1187/cbe.03-09-0012.
[2] Cohen JE. Mathematics is biology's next microscope, only better; biology is mathematics' next physics, only better. PLoS Biol. 2004 Dec;2(12):e439. doi: 10.1371/journal.pbio.0020439.
[3] Charitou T, Bryan K, Lynn DJ. Using biological networks to integrate, visualize and analyze genomics data. Genet Sel Evol. 2016 Mar 31;48:27. doi: 10.1186/s12711-016-0205-1.
[4] Zhou G, Li S, Xia J. Network-Based Approaches for Multi-omics Integration. Methods Mol Biol. 2020;2104:469-487. doi: 10.1007/978-1-0716-0239-3_23.
[5] Joshi A, Rienks M, Theofilatos K, Mayr M. Systems biology in cardiovascular disease: a multiomics approach. Nat Rev Cardiol. 2021 May;18(5):313-330. doi: 10.1038/s41569-020-00477-1.
[6] Watts DJ, Strogatz SH. Collective dynamics of 'small-world' networks. Nature. 1998 Jun 4;393(6684):440-2. doi: 10.1038/30918.
[7] Barabási AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011 Jan;12(1):56-68. doi: 10.1038/nrg2918.
[8] MacLean F. Knowledge graphs and their applications in drug discovery. Expert Opin Drug Discov. 2021 Sep;16(9):1057-1069. doi: 10.1080/17460441.2021.1910673.
[9] Wang H, Zu Q, Lu M, Chen R, Yang Z, Gao Y, Ding J. Application of Medical Knowledge Graphs in Cardiology and Cardiovascular Medicine: A Brief Literature Review. Adv Ther. 2022 Sep;39(9):4052-4060. doi: 10.1007/s12325-022-02254-7.
[10] Unni DR, Moxon SAT, Bada M, Brush M, Bruskiewich R, Caufield JH, Clemons PA, Dancik V, Dumontier M, Fecho K, Glusman G, Hadlock JJ, Harris NL, Joshi A, Putman T, Qin G, Ramsey SA, Shefchek KA, Solbrig H, Soman K, Thessen AE, Haendel MA, Bizon C, Mungall CJ; Biomedical Data Translator Consortium. Biolink Model: A universal schema for knowledge graphs in clinical, biomedical, and translational science. Clin Transl Sci. 2022 Aug;15(8):1848-1855. doi: 10.1111/cts.13302.
[11] Ratajczak F, Joblin M, Ringsquandl M, Hildebrandt M. Task-driven knowledge graph filtering improves prioritizing drugs for repurposing. BMC Bioinformatics. 2022 Mar 4;23(1):84. doi: 10.1186/s12859-022-04608-y.