DATASET
Open Source Community
BullyDataset
A Sina Weibo comment dataset specifically collected for cyberbullying detection, where comments are labeled as bullying if they contain gender discrimination, racial or regional insults, profanity or humiliation, factual distortion, expressions of violence, attacks on appearance or family members, repetitive negative comments, calls for others to join the attack, or imposing unwanted or insulting nicknames.
Updated 1/16/2024
github
Description
BullyDataset Overview
Dataset Description
- Source: Sina Weibo comment
- Purpose: Specifically for cyberbullying detection
Label Definition
- Bullying Comment: A Weibo comment that satisfies any of the following conditions:
- Uses gender‑discriminatory, racial or regional slurs.
- Uses abusive or insulting language to criticize others without reasonable justification.
- Clearly distorts facts or attempts to bias views on minority groups, making unfounded accusations.
- Expresses violent tendencies or curses toward minority groups.
- Contains attacks on a person’s appearance, body, or family members.
- Repeatedly posts negative comments, or calls on others to join the attack.
- Imposes an unwanted or insulting nickname on others.
Citation Information
- Authors: Nijia Lu, Guohua Wu, Zhen Zhang, Yitao Zheng, Yizhi Ren, Kim‑Kwang Raymond Choo
- Year: 2019
- Paper Title: Cyberbullying Detection in Social Media Text Based on Character‑level Convolutional Neural Networks with Shortcuts
- Contact: lunijia@hdu.edu.cn
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Online Bullying Detection
Social Media Analysis
Source
Organization: github
Created: 7/2/2019
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.