Back to datasets
Dataset assetOpen Source CommunityCybersecurityIntrusion Detection

Jetlime/NF-CSE-CIC-IDS2018-v2

The NF‑CSE‑CIC‑IDS2018‑v2 dataset is a NetFlow version derived from the original CSE‑CIC‑IDS2018 pcaps, intended for network intrusion detection systems. It includes 18,893,708 flow records, of which 2,258,141 (11.95 %) are attack samples and 16,635,567 (88.05 %) are benign. The dataset is stratified by attack type and split into training (95 %) and testing (5 %) sets. Features include source/destination IPs, ports, protocol, byte/packet counts, flow duration, and many derived statistics. ## Dataset Structure - **Classes**: Benign, BruteForce, Bot, DoS, DDoS, Infiltration, Web Attacks, etc. - **Feature List**: Includes fields such as IPV4_SRC_ADDR, IPV4_DST_ADDR, L4_SRC_PORT, PROTOCOL, IN_BYTES, OUT_BYTES, FLOW_DURATION_MILLISECONDS, TCP_FLAGS, and many others. - **Splits**: Train (≈17.9 M samples), Test (≈0.94 M samples). The dataset is publicly available for academic research; commercial use requires author permission.

Source
hugging_face
Created
Nov 28, 2025
Updated
May 24, 2024
Signals
451 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Description

NF‑CSE‑CIDS2018‑v2 is a NetFlow dataset generated from the original CSE‑CIC‑IDS2018 pcaps. It contains 18,893,708 flows, with 11.95 % attacks and 88.05 % benign traffic. The data are split 95 %/5 % into training and test sets and include features such as IP addresses, ports, protocol types, byte/packet counts, flow duration, TCP flags, and various derived statistics.

Class Distribution

ClassCount
Benign7,373,198
BruteForce287,597
Bot15,683
DoS269,361
DDoS380,096
Infiltration62,072
Web Attacks4,394

(The table continues with additional classes.)

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio