DATASET
Open Source Community
mmdjiji/bert-chinese-idioms
This dataset is used to train a BERT language model for Chinese idioms, with training data generated by the Node.JS script preprocess.js.
Updated 6/28/2022
hugging_face
Description
Dataset Overview
License
- License Type: GPL-3.0
Dataset Usage
- Used to train language models
Preprocessing Tools
- Preprocessing script: preprocess.js
- Script type: Node.JS
AI studio
Generate PPTs instantly with Nano Banana Pro.
Access Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Natural Language Processing
Chinese Idioms
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.