Wikidata as a FAIR knowledge graph for the life sciences
Abstract
Wikidata is a community-maintained knowledge base that epitomizes the FAIR principles of Findability, Accessibility, Interoperability, and Reusability. Here, we describe the breadth and depth of biomedical knowledge contained within Wikidata, assembled from primary knowledge repositories on genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases. We built a collection of open-source tools that simplify the addition and synchronization of Wikidata with source databases. We furthermore demonstrate several use cases of how the continuously updated, crowd-contributed knowledge in Wikidata can be mined. These use cases cover a diverse cross section of biomedical analyses, from crowdsourced curation of biomedical ontologies, to phenotype-based diagnosis of disease, to drug repurposing.
Subject Area
- Biochemistry (11736)
- Bioengineering (8746)
- Bioinformatics (29186)
- Biophysics (14964)
- Cancer Biology (12084)
- Cell Biology (17401)
- Clinical Trials (138)
- Developmental Biology (9418)
- Ecology (14176)
- Epidemiology (2067)
- Evolutionary Biology (18299)
- Genetics (12235)
- Genomics (16793)
- Immunology (11863)
- Microbiology (28066)
- Molecular Biology (11580)
- Neuroscience (60925)
- Paleontology (451)
- Pathology (1870)
- Pharmacology and Toxicology (3238)
- Physiology (4956)
- Plant Biology (10422)
- Synthetic Biology (2883)
- Systems Biology (7338)
- Zoology (1650)