LDNOOBW / List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-WordsLinks
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
☆3,168Updated last year
Alternatives and similar repositories for List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
Users that are interested in List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words are comparing it to the libraries listed below
Sorting:
- Some of the hidden norms about Hacker News not otherwise covered in the Guidelines and the FAQ.☆3,722Updated 7 months ago
- Compact Language Detector 2☆871Updated 4 years ago
- This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of th…☆4,172Updated 2 years ago
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,168Updated this week
- Acts like FuckAdBlock.js but always says that no adblock was detected☆1,259Updated 8 years ago
- Don't commit when you're drunk☆1,450Updated 9 years ago
- A very long list of English profanity.☆272Updated this week
- Tools to download and cleanup Common Crawl data☆1,027Updated 2 years ago
- Crawl BookCorpus☆843Updated 2 years ago
- World Factbook Country Profiles in JSON - Free Open Public Domain Data - No API Key Required ;-)☆1,040Updated 3 weeks ago
- Awesome Code Points☆769Updated last year
- The project where literally anything* goes.☆1,971Updated 2 weeks ago
- Archived list of domains using Cloudflare DNS at the time of the CloudBleed announcement.☆1,920Updated 8 years ago
- ☆837Updated last year
- A demo of cross-origin login detection for most major web platforms☆861Updated 3 years ago
- A very simple Chrome Extension that displays the automated image tags that Facebook has generated for your images☆1,487Updated 3 years ago
- Access a database of word frequencies, in various natural languages.☆1,537Updated 8 months ago
- A collection of SQL queries to social media datasets.☆1,544Updated 5 years ago
- A Python parser for MediaWiki wikicode☆823Updated 2 months ago
- Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed.☆736Updated 2 years ago
- [ab]using Unicode to create tragedy☆3,748Updated last year
- Source code for https://gethttpsforfree.com/☆2,205Updated 2 months ago
- ☆845Updated 2 years ago
- Letterpress Word List☆412Updated 9 years ago
- For when people get too hyped up about things☆7,295Updated last year
- A robust Python tool for text-based AI training and generation using GPT-2.☆1,840Updated 2 years ago
- Generate Google Analytics tracking code with any isogrammic parameters you like☆407Updated 6 years ago
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆3,406Updated 2 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,907Updated last year
- Using speech-to-text to fully check out during con calls☆2,093Updated 6 years ago