llm-blender/mix-instruct
MixInstruct is a dataset released for the LLM‑Blender project. It contains responses from 11 currently popular instruction‑following LLMs, including Stanford Alpaca, FastChat Vicuna, Dolly V2, StableLM, Open Assistant, Koala, Baize, Flan‑T5, ChatGLM, MOSS, and Mosaic MPT. The dataset is evaluated with automatic metrics (BLEU, ROUGE, BERTScore, BARTScore) and pairwise comparisons of 4,771 test samples performed by ChatGPT. The format is JSON, with fields for instruction, input, output, and candidate responses, each accompanied by detailed scores.
Description
Dataset Overview
Basic Information
- Dataset Name: MixInstruct
- Project: LLM‑Blender
- License: MIT
- Task Category: Text Generation
- Language: English
- Dataset Size: 100K<n<1M
Data Content
- Included Models: The dataset includes 11 responses from popular instruction‑following LLMs, namely Stanford Alpaca, FastChat Vicuna, Dolly V2, StableLM, Open Assistant, Koala, Baize, Flan‑T5, ChatGLM, MOSS, and Mosaic MPT.
- Evaluation Metrics: Automatic metrics such as BLEU, ROUGE, BERTScore, BARTScore are provided, along with pairwise comparison results generated by ChatGPT.
Data Format
- Structure: JSON, each entry contains
id,instruction,input,output, andcandidatesfields. - Additional Fields:
cmp_resultsrecords model‑to‑model comparison outcomes produced by ChatGPT.
Evaluation Results
- Automatic Metrics: Detailed performance metrics for training, validation, and test splits are supplied for each model.
- ChatGPT Comparison Results: Includes BERTScore, BARTScore, BLEURT, GPT‑Rank and other scores for model pairwise comparisons.
Best Model Performance
- Top Model: Open Assistant achieves the best scores across multiple metrics.
- Oracle Model: An oracle model's performance is provided for reference and comparison with the top model.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.