Explore high-quality datasets for your AI and machine learning projects.
ETFD is a comprehensive, high‑quality dataset designed to advance research and development in fraud transaction detection on the Ethereum blockchain. Generated by ETDG, it addresses common challenges in public Ethereum fraud detection datasets such as single‑cardinality, high‑cardinality, missing values, and data encoding issues, thereby reducing model over‑fitting risk and improving performance.
This dataset is an eight‑class fraud dataset intended solely for academic and research use by universities and research institutes; commercial use is prohibited.
The PaySim dataset contains over 6 million data points, each with 9 features, generated by the PaySim retail simulation software. It is used for fraud and anomaly detection, where fraudulent behavior simulates agents profiting by transferring funds and withdrawing cash from the system.