JUHE API Marketplace
DATASET
Open Source Community

CICIDS2018

The dataset comprises labeled network traffic data, encompassing various attacks (e.g., DoS, brute‑force, SQL injection, botnet) and normal traffic.

Updated 10/3/2024
github

Description

Dataset Overview

Dataset Information

Dataset Name

  • CICIDS2018 Dataset

Dataset Description

  • Description: This dataset contains labeled network traffic data covering multiple attack types (e.g., DoS, brute‑force, SQL injection, botnet) and normal traffic.
  • Link: The dataset can be downloaded here.
  • Size: Large dataset split into multiple CSV files, total size exceeds several hundred MB.

Dataset Usage

  • Training Data: dataset/train_data.csv
  • Test Data: dataset/test.csv
  • Training Data Version: artifacts/train_data.csv

Dataset Processing

Data Ingestion

  • Script: src/components/data_ingestion.py

Data Transformation

  • Script: src/components/data_transformation.py

Model Training

  • Script: src/components/model_trainer.py

Model Performance

Test Accuracy

  • Test Accuracy: 89.75%
  • Training Accuracy: 89.87%

F1 Score

  • Test F1 Score: 88.27%
  • Training F1 Score: 88.40%

Recall

  • Test Recall: 89.75%
  • Training Recall: 89.87%

Precision

  • Test Precision: 89.08%
  • Training Precision: 89.31%

Balanced Accuracy

  • Balanced Accuracy: 86.55%

ROC AUC

  • Test ROC AUC: 99.17%
  • Training ROC AUC: 99.21%

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Cybersecurity
Network Intrusion Detection

Source

Organization: github

Created: 10/2/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.