nachocab / words-by-frequencyLinks
A repository of words in multiple languages sorted by their frequency
☆12Updated 2 years ago
Alternatives and similar repositories for words-by-frequency
Users that are interested in words-by-frequency are comparing it to the libraries listed below
Sorting:
- Extract data from German Wiktionary XML files.☆26Updated 10 months ago
- A simple phonetic respelling for the English language☆10Updated last month
- Unicode-only CJKV IDS data☆13Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last week
- A list of vocabulary lists☆22Updated 5 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆159Updated 10 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Gather modern English word frequencies from all enwiki articles.☆226Updated last year
- hand-written dictionaries from the FreeDict project☆443Updated 3 months ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆78Updated 6 months ago
- 🏆 • 5050 most frequent words in 109 languages☆45Updated 2 years ago
- Anki add-on to look up vocabulary using Wiktionary☆22Updated 8 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆32Updated 6 years ago
- A Python Wiktionary Parser☆367Updated 3 months ago
- Verb forms dictionary☆67Updated 8 years ago
- The Open English WordNet☆643Updated last month
- Wiktionary dump file parser and multilingual data extractor☆1,030Updated last week
- A component-based CJK character search engine☆14Updated last year
- This project brings the official Duolingo Stories to new languages, translated by a community effort.☆191Updated this week
- CSV files containing all french adjectives, adverbs, conjunctions, determiners, nouns, prepositions, pronouns, verbs and their gender, ty…☆144Updated last year
- Machine-readable Wiktionary☆77Updated last year
- Hyphenation of English words☆13Updated 8 years ago
- Wikitionary in accessible JSON format☆35Updated 2 years ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆95Updated 2 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆55Updated 2 months ago
- Collaborative data curation for Glottolog☆176Updated 2 weeks ago