snguyenthanh / better_profanity
Blazingly fast cleaning swear words (and their leetspeak) in strings
☆220Updated 11 months ago
Alternatives and similar repositories for better_profanity:
Users that are interested in better_profanity are comparing it to the libraries listed below
- A Python library for detecting and filtering profanity☆161Updated 4 years ago
- A fast, robust Python library to check for offensive language in strings.☆640Updated 8 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆151Updated last year
- A fast, robust library to check for offensive language in strings, dropdown replacement of "profanity-check".☆74Updated 2 months ago
- A Python library to check for (and clean) profanity in strings.☆63Updated 3 years ago
- A Python library for working with and comparing language codes.☆346Updated 4 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-translate☆108Updated last year
- Spelling corrector in python☆480Updated 3 months ago
- 80x faster and 95% accurate language identification with Fasttext☆152Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆114Updated last month
- Async PRAW, an abbreviation for "Asynchronous Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's A…☆117Updated last week
- Measure the readability of a given text using surface characteristics☆78Updated 2 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆740Updated last month
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆71Updated 2 months ago
- A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.☆837Updated last week
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆70Updated last year
- Python client for Typesense: https://github.com/typesense/typesense☆196Updated 2 weeks ago
- Parse numbers written in natural language☆113Updated 6 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆257Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆314Updated last week
- ☆169Updated 3 weeks ago
- Convert number words (eg. twenty one) to numeric digits (21)☆176Updated last year
- Pilmoji is a fast and reliable emoji renderer for PIL.☆84Updated 10 months ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 8 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 3 months ago
- ndjson with the same interface as the builtin json module☆68Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated last year
- Bi-directional transliterator for Python. Transliterates (unicode) strings according to the rules specified in the language packs.☆302Updated last year
- universal character encoding detector☆58Updated 7 months ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago