Back to datasets
Dataset assetOpen Source CommunityFraud DetectionBlockchain

Ethereum Transaction Fraud Detection (ETFD)

ETFD is a comprehensive, high‑quality dataset designed to advance research and development in fraud transaction detection on the Ethereum blockchain. Generated by ETDG, it addresses common challenges in public Ethereum fraud detection datasets such as single‑cardinality, high‑cardinality, missing values, and data encoding issues, thereby reducing model over‑fitting risk and improving performance.

Source
github
Created
Aug 7, 2024
Updated
Aug 8, 2024
Signals
157 views
Availability
Linked source ready
Overview

Dataset description and usage context

Ethereum Transaction Data Generator (ETDG) and Ethereum Transaction Fraud Detection (ETFD) Datasets

Overview

This repository contains the Ethereum Transaction Data Generator (ETDG) and the Ethereum Transaction Fraud Detection (ETFD) datasets.

ETDG Dataset

ETDG is a tool for generating high‑quality transaction datasets suitable for classification tasks. It employs graph traversal, genetic algorithms, and a novel fitness function for effective feature extraction. This approach mitigates complexities in Ethereum transaction data related to cardinality, data encoding, and data staleness.

ETFD Dataset

ETFD is a comprehensive, high‑quality dataset aimed at promoting research and development of fraud transaction detection on the Ethereum blockchain. The dataset was generated using ETDG and resolves typical issues found in public Ethereum fraud detection datasets—single‑cardinality, high‑cardinality, missing values, and data encoding problems—thus reducing the risk of model over‑fitting and enhancing model performance.

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio