Back to datasets
Dataset assetOpen Source CommunityLanguage CodesInternationalization
ISO Language Codes
Contains comprehensive information on ISO 639‑1 and ISO 639‑2 language codes, as well as IETF language tags. The dataset provides codes for 184 languages along with their English names, and more detailed ISO 639‑2 entries that include both English and French names. Special language codes and IETF tags are also included.
Source
github
Created
Jan 13, 2015
Updated
Jan 24, 2024
Signals
407 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Data Sources
- Data sourced from the Library of Congress (the registration authority for ISO 639‑2) and the Unicode Common Locale Data Repository.
Data Files
data/language-codes.csv
- Contains ISO 639‑1 (two‑letter) codes for 184 languages and their English names.
data/language-codes-3b2.csv
- Contains ISO 639‑2 (three‑letter) bibliographic codes, the corresponding ISO 639‑1 codes, and English names.
data/language-codes-full.csv
- Includes all ISO 639‑2 (three‑letter) codes, associated ISO 639‑1 codes (if any), and English and French names for each language.
- Two versions of three‑letter codes are present: bibliographic and terminologic. Every language has a bibliographic code; only a few have a terminologic code, which is designed to resemble the corresponding ISO 639‑1 two‑letter code.
- Includes four special codes: mul, und, mis, zxx; and a reserved range qaa‑qtz.
data/ietf-language-tags.csv
- Lists all IETF language tags, sourced from http://www.iana.org/assignments/language-tag-extensions-registry and included in the http://www.unicode.org/Public/cldr/latest/core.zip
/maindirectory.
License
- This dataset is licensed under the Public Domain Dedication and License (PDDL).
- Users should check the original sources for any specific restrictions when using the data.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.