LDNOOBW / List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-WordsLinks
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
☆3,271Updated last year
Alternatives and similar repositories for List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
Users that are interested in List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words are comparing it to the libraries listed below
Sorting:
- Download the entire Wayback Machine archive for a given URL.☆3,124Updated 8 months ago
- Stand-alone language identification system☆2,452Updated 6 years ago
- Just the facts -- web page content extraction☆1,279Updated 6 months ago
- Module for automatic summarization of text documents and HTML pages.☆3,657Updated 2 weeks ago
- Port of Google's language-detection library to Python.☆1,867Updated 10 months ago
- ☆860Updated 2 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆830Updated 2 years ago
- This Word Does Not Exist☆1,020Updated 3 years ago
- Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)☆5,962Updated 2 years ago
- Compact Language Detector 2☆887Updated 4 years ago
- Multilingual text (NLP) processing toolkit☆2,360Updated 2 years ago
- ☆1,253Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,205Updated last month
- Beautiful visualizations of how language differs among document types.☆2,327Updated 8 months ago
- A lightning fast Finite State machine and REgular expression manipulation library.☆1,878Updated last year
- A statuspage generator that lets you host your statuspage for free on Github.☆3,888Updated 3 years ago
- Multilingual word vectors in 78 languages☆1,199Updated 2 years ago
- Tools to download and cleanup Common Crawl data☆1,040Updated 2 years ago
- A list of (almost) all headless web browsers in existence☆6,478Updated 3 months ago
- A linter for prose.☆4,491Updated 3 weeks ago
- Unicode's answer to Base64☆2,181Updated last week
- 🦆 Contextually-keyed word vectors☆1,668Updated 8 months ago
- Conditional Transformer Language Model for Controllable Generation☆1,883Updated 8 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,341Updated 2 months ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Updated 2 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆779Updated 2 years ago
- Subtle and not-so-subtle shell tweaks that will slowly drive people insane.☆2,199Updated 2 years ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆713Updated last year
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,848Updated last week
- The implementation of DeBERTa☆2,188Updated 2 years ago