nachocab / words-by-frequency
A repository of words in multiple languages sorted by their frequency
☆11Updated last year
Alternatives and similar repositories for words-by-frequency:
Users that are interested in words-by-frequency are comparing it to the libraries listed below
- Extract data from German Wiktionary XML files.☆26Updated 2 months ago
- A simple phonetic respelling for the English language☆10Updated last year
- Des exemples et des supports pour les structures de données avancées en java. www.ispm-edu.com☆9Updated 5 months ago
- A component-based CJK character search engine☆13Updated 7 months ago
- kanjidraw - handwritten kanji recognition library + gui☆24Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆98Updated last week
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆44Updated 4 years ago
- List of the most common words in many languages☆168Updated this week
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆19Updated last year
- cc-kedict: Creative Commons Korean-English Dictionary☆41Updated 3 years ago
- Mirror of the Moby Project containing public-domain lexical resources; word lists, thesaurus, hyphenation, pronunciation.☆14Updated 10 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆10Updated 10 months ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- over 6_00_000 english words data set arranged with each words frequency☆15Updated 3 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆48Updated 4 months ago
- Wikitionary in accessible JSON format☆35Updated 2 years ago
- ☆72Updated 3 months ago
- This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpus…☆71Updated 6 years ago
- A simple dictionary in Manchu, Chinese and English.☆11Updated 10 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- For review of draft Unihan database changes, removals, and additions by experts.☆52Updated this week
- Beautiful animated SVG or GIF kanji from KanjiVG data set.☆68Updated 8 years ago
- A CC0 blackletter font created for the Standard Ebooks project.☆9Updated 4 years ago
- Noto Mongolian☆15Updated 4 months ago
- A library for fetching and reading Tatoeba's weekly exports☆22Updated last year
- ☆28Updated 2 years ago
- Lexical data at Unicode☆69Updated 6 months ago
- Public resources for improving the Mongolian script’s text representation and shaping situation.☆17Updated last year
- A list of vocabulary lists☆21Updated 4 years ago