nachocab / words-by-frequencyLinks
A repository of words in multiple languages sorted by their frequency
☆12Updated 2 years ago
Alternatives and similar repositories for words-by-frequency
Users that are interested in words-by-frequency are comparing it to the libraries listed below
Sorting:
- A simple phonetic respelling for the English language☆10Updated 3 months ago
- Gather modern English word frequencies from all enwiki articles.☆222Updated last year
- Unicode-only CJKV IDS data☆12Updated last year
- 🏆 • 5050 most frequent words in 109 languages☆43Updated 2 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- The Open English WordNet☆620Updated this week
- Extract data from German Wiktionary XML files.☆26Updated 8 months ago
- Verb forms dictionary☆67Updated 8 years ago
- universal syllabification algorithms☆45Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Helsinki Finite-State Technology (library and application suite)☆133Updated last week
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆16Updated this week
- Monolingual wordlists with pronunciation information in IPA☆670Updated 3 months ago
- A list of vocabulary lists☆22Updated 5 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- Public resources for improving the Mongolian script’s text representation and shaping situation.☆18Updated last year
- A Python Wiktionary Parser☆363Updated last month
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- A nicely typeset table of the 100 most common radicals in Chinese characters☆17Updated 4 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆77Updated 5 months ago
- Collaborative data curation for Glottolog☆172Updated last month
- Etymological graphs based on Wiktionary dumps☆23Updated 6 months ago
- A component-based CJK character search engine☆14Updated last year
- Hieroglyphs Everywhere fonts☆22Updated 3 years ago
- Compare English corpora by measuring differences between common words.☆13Updated 2 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Lexical data at Unicode☆70Updated last year
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆20Updated last year
- A simple dictionary in Manchu, Chinese and English.☆12Updated 10 years ago