A structured and enriched dataset for Japanese language learners and developers. This repository provides comprehensive information about Japanese kanji characters and vocabulary words used across JLPT levels N5 to N1.
A list of kanji used in JLPT exams, including:
kanji: The character itself.strokes: Number of strokes to write the kanji.radical_number: The index of the kanji's radical.frequency: Kanji frequency rank (lower = more common).jlpt: JLPT level (N5toN1).begins: Number of words where this kanji appears at the beginning.used_in: Total number of words containing this kanji.component_in: Number of kanji that this kanji helps compose.description: A brief English explanation.
A list of Japanese words, including:
kanji: Word in kanji.reading: Reading in kana (hiragana).pos: Part of speech (abbreviated, seedictionary-tags.json).glossary_en: English definitions and example sentences.glossary_ru: Russian translations (optional).sequence: Usage frequency (higher = more frequent; negative = rare).
A decoding file for part-of-speech (POS) abbreviations used in dictionary.json.
- Language learning apps
- Linguistic research
- JLPT preparation tools
- Kanji/vocabulary search engines
- Educational games
- Import into Elasticsearch, SQLite, or other databases
{
"kanji": "加担",
"reading": "かたん",
"pos": "n vs vi",
"glossary_en": [
"support",
"participation",
"assistance",
"complicity",
"conspiracy"
],
"glossary_ru": [
"1) помощь; поддержка; участие (в чём-л.)",
"2) сговор; соучастие"
],
"sequence": 1590450
}