DATASET
Open Source Community
Education Dialogue Dataset
The Education Dialogue dataset comprises dialogues generated by Gemini Ultra, occurring between teachers and students. Teachers are prompted to teach specific topics, while students are prompted with their learning preferences. The dataset includes 40,000 training examples and 7,234 test examples, each consisting of a complete teacher‑student conversation with metadata on the topic and teacher/student preferences.
Updated 10/29/2024
github
Description
Education Dialogue Dataset
Data Description
- Contains 40,000 training samples and 7,234 test samples.
- Each sample is a complete teacher‑student conversation, including topic and teacher/student preference metadata.
Data Format
- The data consist of six JSON files: five for training and one for testing.
- Each dialogue entry includes the following fields:
background_info: context information containing:topic: the subject the teacher needs to teach.student_prefrences: the student's preferred learning mode, e.g., lecture‑style or hands‑on activity.teacher_prefrences: the teacher's preferred teaching mode, e.g., lecture‑style or hands‑on activity.student_reactions: the student's reaction if not taught in their preferred way, e.g., loss of interest or possible adaptation.teacher_reactions: the teacher's reaction if the student is not taught in the teacher's preferred way, e.g., feeling frustrated or possible adaptation.
conversation: a list of exchanges between teacher and student; each turn includes arolefield identifying the speaker and atextfield containing the utterance.
Data Generation
- Data were generated by prompting Gemini Ultra with the following instructions:
- Simulate a learning scenario where ... (content omitted to preserve original prompting details).
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Education Dialogue
Dialogue Generation
Source
Organization: github
Created: 10/29/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.