Chinese-web-novel
Text DataWeb Novels
The dataset crawls up to 25 chapters per book from https://m.bqgui.cc, resulting in 12,740 entries. After three rounds of cleaning, each entry contains the book title, summary, and novel text. Titles are of high quality, summaries have low usability, and the novel texts have had some ads and symbols removed but still contain low‑quality content.
Source huggingfaceUpdated Oct 16, 20241,463 viewsLinked
Inspect dataset