sts10 / common_word_list_makerLinks
Scrapes Google Books Ngram data to create a long word list
☆13Updated last year
Alternatives and similar repositories for common_word_list_maker
Users that are interested in common_word_list_maker are comparing it to the libraries listed below
Sorting:
- Combine and clean word lists☆91Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆71Updated this week
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- hashgen - the blazingly fast hash generator☆38Updated 3 weeks ago
- Wordlists designed for generating passphrases☆36Updated 4 months ago
- Quickly look up hashes in your terminal using the HashMob API 🔥☆13Updated 2 years ago
- A polite and user-friendly downloader for Common Crawl data☆57Updated 2 months ago
- A repository for word lists I've generated☆33Updated 5 months ago
- Simplified version of a common crawl fetcher☆17Updated last week
- Extract Unique Word Lists From Wikipedia Database☆13Updated 5 years ago
- A collection of impressive and useful results from OpenAI's chatgpt☆74Updated 2 years ago
- Lists of most-frequently-used english words / nouns / verbs etc.☆87Updated 5 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 4 years ago
- An open source investigation tool to collect and analyse public VK community wall posts☆36Updated 3 years ago
- A CLI tool for getting screenshots of URLs using headless chrome☆27Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated this week
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆36Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆226Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- Python 3 library for processing historical English☆67Updated last year
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Updated last month
- Limier est un petit outil en CLI permettant de trouver un flux RSS quand il est planqué sur un site.☆19Updated 2 years ago
- ☆18Updated 2 years ago
- A keylogger written in Rust to run on Windows☆22Updated 6 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
- This is a CLI tool to search for images with Google Reverse Image Search (goris).☆122Updated 4 months ago
- An authorship attribution project with particular emphasis on Twitter analysis☆17Updated 3 years ago
- Reducing Bias in Modeling Real-world Password Strength via Deep Learning and Dynamic Dictionaries☆20Updated last year
- 🌿 Package information to install via hysp package manager☆24Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆55Updated 4 years ago