MalImg
The MalImg dataset is used for malware image classification research. By converting malware code into grayscale images, deep‑learning techniques can improve malware classification efficiency.
Description
MalwareInSight: Classifying Malware Images Using Convolutional Neural Networks
Content Overview
This repository contains all notebooks and the final report (presented as a journal paper) for the study "MalwareInSight: Classifying Malware Images Using Convolutional Neural Networks". The research investigates applying convolutional neural networks (CNN) to the malware classification problem by analyzing images, comparing custom architectures with well‑known pretrained models.
Abstract
Traditional signature‑based malware detection methods often cannot keep pace with the rapid evolution of malware. This work builds on Nataraj et al. (2011)’s innovative approach of converting malware code into grayscale images and leverages deep‑learning techniques to improve malware classification capability. Using the MalImg dataset, the study explores the effectiveness of a custom CNN architecture (named MalwareInSight) versus pretrained models such as VGG16 and VGG19. The custom CNN achieves 99.05 % accuracy, surpassing previous benchmarks. The study highlights the limitations of data augmentation and overly complex architectures on malware images. Results support malware image classification as a viable complementary tool to existing methods, with recommendations for expanding datasets, integrating texture‑based pretrained models, and exploring unsupervised learning in future work.
Keywords: malware, cybersecurity, image classification, deep learning, convolutional neural networks
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 6/25/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.