Explore high-quality datasets for your AI and machine learning projects.
This dataset is used to train a BERT language model for Chinese idioms, with training data generated by the Node.JS script preprocess.js.