JUHE API Marketplace
DATASET
Open Source Community

ibm/otter_primekg

The Otter PrimeKG dataset contains 12,757,257 triples covering proteins, drugs, and diseases, and includes protein sequences, SMILES strings, and textual descriptions. Built on PrimeKG—a precision‑medicine knowledge graph integrating 20 biomedical resources—it describes 17,080 diseases and 4 million relations. PrimeKG includes nodes for 29,786 genes/proteins and 7,957 drugs. The multimodal knowledge graph (MKG) derived from PrimeKG comprises 13 modalities and 12,757,300 edges (154,130 data‑property edges and 12,603,170 object‑property edges), featuring 642,150 protein‑protein interaction edges, 25,653 drug‑protein interaction edges, and 2,672,628 drug‑drug interaction edges.

Updated 6/26/2023
hugging_face

Description

Otter PrimeKG Dataset Overview

Dataset Description

  • Name: Otter PrimeKG
  • Content: 12,757,257 triples covering proteins, drugs, and diseases. Includes protein sequences, SMILES strings, and textual information.

Dataset Details

  • PrimeKG: Integrates 20 biomedical resources, describing 17,080 diseases and 4 million relations. Nodes include 29,786 genes/proteins and 7,957 drugs.
  • Multimodal Knowledge Graph (MKG): Built from PrimeKG, contains 13 modalities, 12,757,300 edges (154,130 data‑property edges and 12,603,170 object‑property edges), among which are 642,150 protein‑protein interaction edges, 25,653 drug‑protein interaction edges, and 2,672,628 drug‑drug interaction edges.

Original Dataset Information

License

  • Type: MIT

Related Models

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Biomedical Knowledge Graph
Precision Medicine

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.