Back to datasets
Dataset assetOpen Source CommunityNatural Language ProcessingBiomedical Text Mining

bigbio/mlee

MLEE is a corpus of event extraction annotations for angiogenesis paper abstracts. It includes manually annotated entities, relations, events, and coreference information covering processes at the molecular, cellular, tissue, and organ levels.

Source
hugging_face
Created
Nov 28, 2025
Updated
Dec 22, 2022
Signals
179 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Basic Information

  • Language: English
  • License: CC BY NC SA 3.0
  • Multilinguality: Monolingual
  • Dataset Name: MLEE
  • Homepage: http://www.nactem.ac.uk/MLEE/
  • Publicly Available: Yes
  • Accessible via PubMed: Yes

Task Types

  • Event Extraction (EVENT_EXTRACTION)
  • Named Entity Recognition (NAMED_ENTITY_RECOGNITION)
  • Relation Extraction (RELATION_EXTRACTION)
  • Coreference Resolution (COREFERENCE_RESOLUTION)

Dataset Description

MLEE is an event‑extraction corpus containing manual annotations of entities, relations, events, and coreference for abstracts of angiogenesis papers. Annotations span molecular, cellular, tissue, and organ‑level biological processes.

Citation

@article{pyysalo2012event, title={Event extraction across multiple levels of biological organization}, author={Pyysalo, Sampo and Ohta, Tomoko and Miwa, Makoto and Cho, Han‑Cheol and Tsujii, Jun'ichi and Ananiadou, Sophia}, journal={Bioinformatics}, volume={28}, number={18}, pages={i575--i581}, year={2012}, publisher={Oxford University Press} }

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio