JUHE API Marketplace
DATASET
Open Source Community

llm-blender/mix-instruct

MixInstruct is a dataset released for the LLM‑Blender project. It contains responses from 11 currently popular instruction‑following LLMs, including Stanford Alpaca, FastChat Vicuna, Dolly V2, StableLM, Open Assistant, Koala, Baize, Flan‑T5, ChatGLM, MOSS, and Mosaic MPT. The dataset is evaluated with automatic metrics (BLEU, ROUGE, BERTScore, BARTScore) and pairwise comparisons of 4,771 test samples performed by ChatGPT. The format is JSON, with fields for instruction, input, output, and candidate responses, each accompanied by detailed scores.

Updated 6/9/2023
hugging_face

Description

Dataset Overview

Basic Information

  • Dataset Name: MixInstruct
  • Project: LLM‑Blender
  • License: MIT
  • Task Category: Text Generation
  • Language: English
  • Dataset Size: 100K<n<1M

Data Content

  • Included Models: The dataset includes 11 responses from popular instruction‑following LLMs, namely Stanford Alpaca, FastChat Vicuna, Dolly V2, StableLM, Open Assistant, Koala, Baize, Flan‑T5, ChatGLM, MOSS, and Mosaic MPT.
  • Evaluation Metrics: Automatic metrics such as BLEU, ROUGE, BERTScore, BARTScore are provided, along with pairwise comparison results generated by ChatGPT.

Data Format

  • Structure: JSON, each entry contains id, instruction, input, output, and candidates fields.
  • Additional Fields: cmp_results records model‑to‑model comparison outcomes produced by ChatGPT.

Evaluation Results

  • Automatic Metrics: Detailed performance metrics for training, validation, and test splits are supplied for each model.
  • ChatGPT Comparison Results: Includes BERTScore, BARTScore, BLEURT, GPT‑Rank and other scores for model pairwise comparisons.

Best Model Performance

  • Top Model: Open Assistant achieves the best scores across multiple metrics.
  • Oracle Model: An oracle model's performance is provided for reference and comparison with the top model.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Natural Language Processing
Machine Learning

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.