Back to datasets
Dataset assetOpen Source CommunityLanguage CodesInternationalization

ISO Language Codes

Contains comprehensive information on ISO 639‑1 and ISO 639‑2 language codes, as well as IETF language tags. The dataset provides codes for 184 languages along with their English names, and more detailed ISO 639‑2 entries that include both English and French names. Special language codes and IETF tags are also included.

Source
github
Created
Jan 13, 2015
Updated
Jan 24, 2024
Signals
407 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Data Sources

Data Files

data/language-codes.csv

  • Contains ISO 639‑1 (two‑letter) codes for 184 languages and their English names.

data/language-codes-3b2.csv

  • Contains ISO 639‑2 (three‑letter) bibliographic codes, the corresponding ISO 639‑1 codes, and English names.

data/language-codes-full.csv

  • Includes all ISO 639‑2 (three‑letter) codes, associated ISO 639‑1 codes (if any), and English and French names for each language.
  • Two versions of three‑letter codes are present: bibliographic and terminologic. Every language has a bibliographic code; only a few have a terminologic code, which is designed to resemble the corresponding ISO 639‑1 two‑letter code.
  • Includes four special codes: mul, und, mis, zxx; and a reserved range qaa‑qtz.

data/ietf-language-tags.csv

License

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio