LDNOOBW / List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-WordsLinks
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
☆3,097Updated 10 months ago
Alternatives and similar repositories for List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
Users that are interested in List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words are comparing it to the libraries listed below
Sorting:
- ☆1,277Updated 2 years ago
- Winamp 2 reimplemented for the browser☆10,542Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,965Updated 2 months ago
- Crawl BookCorpus☆832Updated last year
- ☆1,573Updated 2 years ago
- Minimal keyword extraction with BERT☆3,878Updated 2 months ago
- Longformer: The Long-Document Transformer☆2,130Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,357Updated last year
- BERT score for text generation☆1,747Updated 10 months ago
- git-based selfies for software developers☆4,789Updated last week
- Single-document unsupervised keyword extraction☆1,731Updated this week
- A lightning fast Finite State machine and REgular expression manipulation library.☆1,847Updated 5 months ago
- A more maintainable, easier to share version of the infamous http://mindprod.com/jgloss/unmain.html☆10,111Updated 3 years ago
- 🛏 An HTML to Markdown converter written in JavaScript☆9,814Updated 10 months ago
- Conditional Transformer Language Model for Controllable Generation☆1,885Updated last month
- Dataset of GPT-2 outputs for research in detection, biases, and more☆1,982Updated last year
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,147Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,235Updated 10 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,185Updated 2 years ago
- Toolkit for creating, sharing and using natural language prompts.☆2,872Updated last year
- Language-Agnostic SEntence Representations☆3,640Updated last year
- Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts☆3,406Updated 2 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,868Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,659Updated last year
- Quantized inference code for LLaMA models☆1,049Updated 2 years ago
- Tools to download and cleanup Common Crawl data☆1,013Updated 2 years ago
- Large-scale pretraining for dialogue☆2,385Updated 2 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,116Updated 2 years ago
- The implementation of DeBERTa☆2,097Updated last year
- Transmits AM radio on computers without radio transmitting hardware.☆6,608Updated 8 months ago