nachocab / words-by-frequencyLinks
A repository of words in multiple languages sorted by their frequency
☆11Updated last year
Alternatives and similar repositories for words-by-frequency
Users that are interested in words-by-frequency are comparing it to the libraries listed below
Sorting:
- A simple phonetic respelling for the English language☆10Updated 2 weeks ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆21Updated last year
- Extract data from German Wiktionary XML files.☆26Updated 5 months ago
- Hyphenation of English words☆12Updated 8 years ago
- kanjidraw - handwritten kanji recognition library + gui☆25Updated 2 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- Des exemples et des supports pour les structures de données avancées en java. www.ispm-edu.com☆9Updated 8 months ago
- Neural machine translation between English and toki pona using transfer learning☆24Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆31Updated 8 months ago
- List of the most common words in many languages☆172Updated last week
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆24Updated 8 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆20Updated 3 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆46Updated 4 years ago
- Public resources for improving the Mongolian script’s text representation and shaping situation.☆17Updated last year
- Lexical data at Unicode☆68Updated 9 months ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆14Updated 5 years ago
- A component-based CJK character search engine☆13Updated 9 months ago
- An NLP pipeline for Hebrew☆38Updated this week
- UF5 is an algset to cycle 5 edge pieces. The code/pdf/images in this repository will be extracting efficient algorithms from Cube Explore…☆9Updated this week
- A library for fetching and reading Tatoeba's weekly exports☆23Updated last year
- “Tangut Cangjie Input Method” by KAWASAKI Keigo for RIME☆13Updated 8 months ago
- Gather modern English word frequencies from all enwiki articles.☆213Updated last year
- This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpus…☆73Updated 6 years ago
- Helsinki Finite-State Technology (library and application suite)☆130Updated 2 weeks ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago
- Hieroglyphs Everywhere fonts☆20Updated 3 years ago
- This is a project that aims to make the Coptic language more learnable.☆8Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆101Updated 2 weeks ago