Back to datasets
Dataset assetOpen Source CommunityMachine LearningCode Review

fasterinnerlooper/codereviewer

The dataset contains three parts—generation, refinement, and quality—each with training, testing, and validation configurations, and specifies the corresponding data file paths.

Source
hugging_face
Created
Nov 28, 2025
Updated
Mar 13, 2024
Signals
134 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Configuration Details

Generation Task

  • Training Set: generation/gen-train.jsonl
  • Test Set: generation/gen-test.jsonl
  • Validation Set: generation/gen-valid.jsonl

Refinement Task

  • Training Set: refinement/ref-train.jsonl -Test Set: refinement/ref-test.jsonl -Validation Set: refinement/ref-valid.jsonl

Quality Evaluation Task

  • Training Sets:
    • quality/cls-train-chunk-0.jsonl
    • quality/cls-train-chunk-1.jsonl
    • quality/cls-train-chunk-2.jsonl
    • quality/cls-train-chunk-3.jsonl
  • Test Set: quality/cls-test.jsonl
  • Validation Set: quality/cls-valid.jsonl
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio