Back to datasets
Dataset assetOpen Source CommunityDialogue GenerationEducation Dialogue
Education Dialogue Dataset
The Education Dialogue dataset comprises dialogues generated by Gemini Ultra, occurring between teachers and students. Teachers are prompted to teach specific topics, while students are prompted with their learning preferences. The dataset includes 40,000 training examples and 7,234 test examples, each consisting of a complete teacher‑student conversation with metadata on the topic and teacher/student preferences.
Source
github
Created
Oct 29, 2024
Updated
Oct 29, 2024
Signals
509 views
Availability
Linked source ready
Overview
Dataset description and usage context
Education Dialogue Dataset
Data Description
- Contains 40,000 training samples and 7,234 test samples.
- Each sample is a complete teacher‑student conversation, including topic and teacher/student preference metadata.
Data Format
- The data consist of six JSON files: five for training and one for testing.
- Each dialogue entry includes the following fields:
background_info: context information containing:topic: the subject the teacher needs to teach.student_prefrences: the student's preferred learning mode, e.g., lecture‑style or hands‑on activity.teacher_prefrences: the teacher's preferred teaching mode, e.g., lecture‑style or hands‑on activity.student_reactions: the student's reaction if not taught in their preferred way, e.g., loss of interest or possible adaptation.teacher_reactions: the teacher's reaction if the student is not taught in the teacher's preferred way, e.g., feeling frustrated or possible adaptation.
conversation: a list of exchanges between teacher and student; each turn includes arolefield identifying the speaker and atextfield containing the utterance.
Data Generation
- Data were generated by prompting Gemini Ultra with the following instructions:
- Simulate a learning scenario where ... (content omitted to preserve original prompting details).
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.