Explore high-quality datasets for your AI and machine learning projects.
TeleQnA is a comprehensive dataset designed to evaluate large language models' knowledge in the telecommunications domain. It comprises 10,000 multiple‑choice questions divided into five categories: Lexicon (500), Research Overview (2,000), Research Publications (4,500), Standards Overview (1,000) and Standards Specifications (2,000). Each question is represented in JSON format with five fields: question, options, answer, explanation, and category. Experimental code is provided to assess the performance of OpenAI models such as GPT‑3.5.