DATASET
Open Source Community
Malicious URL v5
This dataset is intended for training and testing malicious URL detectors. It contains multiple URLs together with detailed attributes such as domain name, registrar, registrar address, organization, Alexa traffic rank, etc.
Updated 11/4/2020
github
Description
Dataset Overview
Dataset Content
- Purpose: Used for training and testing malicious URL detectors.
- Data Structure:
- Column Information:
- S.NO
- URL
- Property
- Name
- Organisation
- Address
- City
- State
- Zipcode
- Country
- E‑mails
- Domain
- Alexa Rank
- Registrar
- time
- Example Records:
- Example 1:
- URL: https://www.airtelxstream.in/search
- Property: Legitimate
- Domain: airtelxstream.in
- Alexa Rank: 5793
- Registrar: GoDaddy.com LLC
- Example 2:
- URL: https://www.airtelxstream.in/livetv-channels/sony-sab/mwtv_livetvchannel_347
- Property: Legitimate
- Domain: airtelxstream.in
- Alexa Rank: 5793
- Registrar: GoDaddy.com LLC
- Example 3:
- URL: https://myjiocare.com/sony-liv-premium-account-free/
- Property: Legitimate
- Domain: MYJIOCARE.COM
- Alexa Rank: 2272473
- Registrar: BigRock Solutions Ltd
- Example 4:
- URL: https://www.youtube.com/watch?v=dnbkysr3hoo
- Property: Legitimate
- Domain: YOUTUBE.COM
- Alexa Rank: 2
- Registrar: MarkMonitor Inc.
- Example 1:
- Column Information:
Dataset Applications
- Function: Predict the legitimacy of URLs and detect phishing assets.
- Data Acquisition: Collects dynamic and sensitive URL attributes such as domain, registrar, registrar address, organization, Alexa traffic rank, etc.
Phishing Webpage Examples
- Includes screenshots of phishing webpages mimicking well‑known brands such as WHO, the UK government, Chase Bank, Netflix, Adobe, Facebook, Microsoft, PayPal, Yahoo, etc.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Cybersecurity
URL Analysis
Source
Organization: github
Created: 7/18/2020
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.