nachocab / words-by-frequencyLinks
A repository of words in multiple languages sorted by their frequency
☆12Updated 2 years ago
Alternatives and similar repositories for words-by-frequency
Users that are interested in words-by-frequency are comparing it to the libraries listed below
Sorting:
- A simple phonetic respelling for the English language☆10Updated 2 weeks ago
- Extract data from German Wiktionary XML files.☆26Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆162Updated last year
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- A list of vocabulary lists☆22Updated 5 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆56Updated last month
- Unicode-only CJKV IDS data☆13Updated last year
- Verb forms dictionary☆68Updated 8 years ago
- Web front end for WikDict dictionaries☆21Updated 2 months ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆19Updated 2 years ago
- A Python Wiktionary Parser☆369Updated 5 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Updated last month
- A component-based CJK character search engine☆14Updated last year
- Monolingual wordlists with pronunciation information in IPA☆707Updated 7 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆35Updated last year
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆25Updated 8 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- List of Chinese characters ordered by frequency rank (from most common to least common). Based on Jun Da's Modern Chinese Character Frequ…☆36Updated 2 years ago
- Global ASP - African Storybook Project for the World☆16Updated last month
- 🏆 • 5050 most frequent words in 109 languages☆48Updated 3 years ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆143Updated last year
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- universal syllabification algorithms☆45Updated 3 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- All the words from Google Books, sorted by frequency☆123Updated 2 years ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated this week