sts10 / common_word_list_maker
Scrapes Google Books Ngram data to create a long word list
☆13Updated last year
Alternatives and similar repositories for common_word_list_maker
Users that are interested in common_word_list_maker are comparing it to the libraries listed below
Sorting:
- Combine and clean word lists☆87Updated 2 months ago
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Wordlists designed for generating passphrases☆33Updated 3 weeks ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆21Updated 7 years ago
- Unofficial Anna's Archive API written in JS.☆40Updated last year
- A repository for word lists I've generated☆30Updated 3 months ago
- Batch download books from libgen☆16Updated 9 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆34Updated 3 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆100Updated this week
- Generate random passphrases☆28Updated last month
- Check the "health" of passwords in a KeePass database☆25Updated last month
- Terminal and gui ebook reader☆17Updated 2 months ago
- Expand / Unshorten an exhaustive list of Shortened URL's☆18Updated last year
- Real world example to demonstrate advanced techniques to unmarshall very large xml document with very low memory footprint.☆60Updated last month
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction☆37Updated 2 months ago
- Markdown text to a novel in ePub and PDF.☆52Updated 3 years ago
- Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushs…☆52Updated 2 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆13Updated last year
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆90Updated 2 years ago
- A keylogger written in Rust to run on Windows☆21Updated 6 years ago
- (CLI wrapper) Takes a list of URLs and retrieve screenshots of older versions stored on the Wayback Machine.☆38Updated 2 years ago
- Super simple CLI tool for translating words/terms using Wikipedia. https://gitlab.com/timvisee/wikitrans☆9Updated last year
- ☆14Updated last year
- Unescape strings with escape sequences written out as literal characters.☆23Updated 2 weeks ago
- losslessly convert images to pdf☆70Updated 5 years ago
- ☆24Updated 4 years ago
- Quickly look up hashes in your terminal using the HashMob API 🔥☆12Updated 2 years ago
- Easily and securely send things from one computer to another 🐊 📦☆19Updated last year
- 🏆 • 5050 most frequent words in 109 languages☆42Updated 2 years ago