messense / fasttext-wheel
Build and upload fastText Python wheels to PyPI
☆23Updated last year
Alternatives and similar repositories for fasttext-wheel:
Users that are interested in fasttext-wheel are comparing it to the libraries listed below
- ☆168Updated 9 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- My NER Experiments with ModernBERT☆17Updated 2 months ago
- A Fast Levenshtein Distance Library for Python☆82Updated 3 weeks ago
- Efficient string matching with regular expressions☆141Updated last week
- ☆30Updated 2 years ago
- 80x faster and 95% accurate language identification with Fasttext☆150Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated last month
- Pythonic search engine based on PyLucene.☆125Updated 4 months ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆69Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆34Updated last month
- Confection: the sweetest config system for Python☆183Updated 9 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆112Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆196Updated last week
- Measure the readability of a given text using surface characteristics☆79Updated last month
- Simply, faster, sentence-transformers☆141Updated 6 months ago
- Complete lxml external type annotation☆51Updated this week
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 4 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 9 months ago
- Multi-Langauge Identification☆29Updated 7 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 9 months ago
- Train a model, and detect gibberish strings with it.☆61Updated 3 years ago
- Lint Cython files☆75Updated this week
- Parse natural language time expressions in python☆131Updated 2 years ago
- A Streamlit component for annotating text by text selecting.☆40Updated 9 months ago
- universal character encoding detector☆58Updated 6 months ago