zhihu_rlhf_3k
Social QAPreference Datasets
Over 3k human preference records derived from Zhihu Q&A, each question provides a pair of answers with differing up‑vote counts.
Source githubUpdated Apr 10, 2024395 viewsLinked
Inspect dataset
Browse trusted datasets for evaluation, enrichment, and production use.