nachocab / words-by-frequencyLinks
A repository of words in multiple languages sorted by their frequency
☆12Updated 2 years ago
Alternatives and similar repositories for words-by-frequency
Users that are interested in words-by-frequency are comparing it to the libraries listed below
Sorting:
- Extract data from German Wiktionary XML files.☆26Updated 9 months ago
- A simple phonetic respelling for the English language☆10Updated 2 weeks ago
- A list of vocabulary lists☆22Updated 5 years ago
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆159Updated 9 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆58Updated 10 years ago
- A Python Wiktionary Parser☆364Updated 2 months ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- Unicode-only CJKV IDS data☆12Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 weeks ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Verb forms dictionary☆67Updated 8 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆25Updated 2 months ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆49Updated 5 years ago
- Gather modern English word frequencies from all enwiki articles.☆225Updated last year
- A modern, interlingual wordnet interface for Python☆263Updated last month
- 🏆 • 5050 most frequent words in 109 languages☆44Updated 2 years ago
- This is a project that aims to make the Coptic language more learnable.☆10Updated this week
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆73Updated 10 months ago
- Hyphenation of English words☆12Updated 8 years ago
- Sources of Collatinus software - Latin lemmatizer, morphological analyzer and scansion☆77Updated 5 months ago
- Helsinki Finite-State Technology (library and application suite)☆133Updated 3 weeks ago
- Libraries and command-line tools for metrical analysis of epic Greek hexameter☆28Updated 7 years ago
- Create PDFs (A4 format) for practicing Chinese character writing. Completely written in HTML, CSS, Javascript (with jQuery).☆36Updated 6 years ago
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆28Updated 2 years ago
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated last year
- CSV files containing all french adjectives, adverbs, conjunctions, determiners, nouns, prepositions, pronouns, verbs and their gender, ty…☆140Updated last year
- universal syllabification algorithms☆45Updated 2 years ago