Overview of Small Chinese Corpus Datasets

Dataset List

China Provincial‑City Latitude/Longitude Coordinates
- Path: city_location/
China Provincial Postal Code Directory
- Path: postal_provinces/
National Administrative and Urban‑Rural Division Codes (2015)
- Path: china_geo_code/
Idioms Collection
- Path: chengyu/
Chinese Personal Names, plus characters from Jin Yong novels, Romance of the Three Kingdoms, and Dream of the Red Chamber
- Path: chi_names/
Chinese Named‑Entity Recognition Sample
- Path: NER_chi/
Chinese Relation Recognition Sample
- Path: relation_multiple_chi/
Chinese Reading Comprehension Sample
- Path: reading_comprehension_chi/
Chinese Image‑Text QA data (based on MSCOCO)
- Path: Chinese_Visual_QA_pairs/