sts10 / common_word_list_maker
Scrapes Google Books Ngram data to create a long word list
☆13Updated last year
Alternatives and similar repositories for common_word_list_maker:
Users that are interested in common_word_list_maker are comparing it to the libraries listed below
- Combine and clean word lists☆87Updated 3 weeks ago
- A tool to manipulate ePub files.☆24Updated 4 years ago
- hashgen - the blazingly fast hash generator☆30Updated this week
- A polite and user-friendly downloader for Common Crawl data☆36Updated last week
- Quickly look up hashes in your terminal using the HashMob API 🔥☆12Updated last year
- Metadata management and dissemination system for Open Access books☆52Updated last week
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Lists of most-frequently-used english words / nouns / verbs etc.☆60Updated 4 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction☆36Updated 3 weeks ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆21Updated 6 years ago
- Simplified version of a common crawl fetcher☆13Updated 2 weeks ago
- Some tools to help analyze the twitter archive☆62Updated 7 months ago
- Tools to process books in a cloud based pipeline system☆58Updated 2 weeks ago
- Download an entire book (or publication) in PDF file from Hathi Trust Digital Library without "partner login" requirement☆52Updated 6 months ago
- A repository of Juris-M style modules☆16Updated last year
- A bulk QR Code generator.☆33Updated 2 years ago
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆113Updated 7 months ago
- Template file in LaTex to generate legal briefs, with line numbering and formatting commonly used by lawyers in the United States.☆38Updated 4 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Shell Environment Swiss Army Knife☆12Updated last year
- A sentence segmentation library with wide language support optimized for speed and utility.☆59Updated 7 months ago
- A lightweight Rust library for removing Arabic diacritics☆19Updated 2 years ago
- Rule Processor Y is a next-gen Rule processor with complex multibyte character support built to support Hashcat☆32Updated 5 months ago
- A microservice for document conversion at scale☆62Updated 2 weeks ago
- Offline etymological dictionary based on Wiktionary data☆21Updated 3 years ago
- Password Transformation Tool (ptt) is a versatile utility designed for password cracking.☆27Updated 3 weeks ago
- A tool to detect whether a PDF has a bad redaction☆135Updated last month
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆32Updated last month
- Fetch all your bookmarked tweets and make them accessible through a webinterface.☆29Updated last year