Molecular Regulatory Pathways (MRPs) are key to understanding biological functions. Knowledge Graphs (KGs) help organize and analyze MRPs by structuring complex interactions. Current methods for extracting KGs from biomedical literature struggle with hierarchical relationships and context. Large Language Models (LLMs) like GPT-4 show promise in addressing these issues but remain underexplored for end-to-end KG construction. We present reguloGPT, a novel GPT-4 based in-context learning prompt designed for the end-to-end extraction of a regulatory graph and context from a sentence that describes regulatory interactions. reguloGPT employs a context-aware relational graph to capture MRPs' hierarchical structure and resolves semantic inconsistencies by embedding context directly within the relational edges. We created a benchmark dataset comprising four hundred annotated PubMed titles on N6 -methyladenosine (m6 A) regulations. Rigorous evaluations of reguloGPT on the benchmark dataset showed marked improvements over existing algorithms and other LLMs. We further developed a novel G-Eval scheme, leveraging GPT-4 for annotation-free performance evaluation that demonstrated agreement with evaluations on the benchmark dataset. Lastly, we constructed m6 A-KG by applying reguloGPT to 1,396 m6 A-related titles and demonstrated its utility in elucidating m6 A's reg-ulatory mechanisms of cancer phenotypes across various cancers. These results underscore reguloGPT's potential for advancing biological knowledge extraction. All reguloGPT works including source code, benchmark datasets, and m6 A-KG are available at https://github.com/Huang-AI4Medicine-Lab/reguloGPt.
ReguloGPT: Harnessing GPT for End-to-End Knowledge Graph Construction of Molecular Regulatory Pathways
Xidong Wu,Sumin Jo,Yiming Zeng,Arun Das,Tinghe Zhang,Parth Patel,Yuanjing Wei,Lei Li,Shou-Jiang Gao,Jianqiu Zhang,D. Pratt,Yu-Chiao Chiu,Yufei Huang
Published 2024 in 2024 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI)
ABSTRACT
PUBLICATION RECORD
- Publication year
2024
- Venue
2024 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI)
- Publication date
2024-11-10
- Fields of study
Biology, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-36 of 36 references · Page 1 of 1
CITED BY
Showing 1-8 of 8 citing papers · Page 1 of 1