bsolomon1124 / demojiLinks
Accurately find/replace/remove emojis in text strings
☆163Updated last year
Alternatives and similar repositories for demoji
Users that are interested in demoji are comparing it to the libraries listed below
Sorting:
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- ☆172Updated 4 months ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Pythonic search engine based on PyLucene.☆128Updated 8 months ago
- A Python library for working with and comparing language codes.☆345Updated 3 months ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆74Updated last month
- A Python implementation of Lunr.js 🌖☆198Updated 5 months ago
- Convert number words (eg. twenty one) to numeric digits (21)☆177Updated last year
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- A Python module to convert natural language numerics into ints and floats.☆229Updated 10 months ago
- 📂 Additional lookup tables and data resources for spaCy☆108Updated 2 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆67Updated 2 years ago
- Abydos NLP/IR library for Python☆188Updated 2 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Parse numbers written in natural language☆122Updated 9 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆472Updated 6 months ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- A python package to simulate typographical errors.☆36Updated last year
- ☆70Updated 2 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- ☆69Updated 3 years ago
- Python library that reads JSON files of any size.☆198Updated 2 years ago
- Extract dates from text☆64Updated 4 years ago
- ISO 639 language codes☆46Updated 5 months ago