Back to datasets
Dataset assetOpen Source CommunityCybersecurityURL Analysis

Malicious URL v5

This dataset is intended for training and testing malicious URL detectors. It contains multiple URLs together with detailed attributes such as domain name, registrar, registrar address, organization, Alexa traffic rank, etc.

Source
github
Created
Jul 18, 2020
Updated
Nov 4, 2020
Signals
194 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Content

Dataset Applications

  • Function: Predict the legitimacy of URLs and detect phishing assets.
  • Data Acquisition: Collects dynamic and sensitive URL attributes such as domain, registrar, registrar address, organization, Alexa traffic rank, etc.

Phishing Webpage Examples

  • Includes screenshots of phishing webpages mimicking well‑known brands such as WHO, the UK government, Chase Bank, Netflix, Adobe, Facebook, Microsoft, PayPal, Yahoo, etc.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio