JUHE API Marketplace
DATASET
Open Source Community

mmdjiji/bert-chinese-idioms

This dataset is used to train a BERT language model for Chinese idioms, with training data generated by the Node.JS script preprocess.js.

Updated 6/28/2022
hugging_face

Description

Dataset Overview

License

  • License Type: GPL-3.0

Dataset Usage

  • Used to train language models

Preprocessing Tools

  • Preprocessing script: preprocess.js
  • Script type: Node.JS

AI studio

Generate PPTs instantly with Nano Banana Pro.

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Natural Language Processing
Chinese Idioms

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.