messense / fasttext-wheel
Build and upload fastText Python wheels to PyPI
☆23Updated last year
Alternatives and similar repositories for fasttext-wheel:
Users that are interested in fasttext-wheel are comparing it to the libraries listed below
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated last month
- ☆168Updated 9 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated last year
- Language detection using Spacy and Fasttext☆55Updated last year
- A Streamlit component for annotating text by text selecting.☆40Updated 9 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆110Updated last week
- A Fast Levenshtein Distance Library for Python☆82Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆196Updated this week
- Pythonic search engine based on PyLucene.☆125Updated 3 months ago
- ☆68Updated 2 years ago
- Super lightweight function registries for your library☆177Updated 9 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 10 months ago
- Confection: the sweetest config system for Python☆183Updated 9 months ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆69Updated last year
- Python bindings for cld3☆27Updated last year
- My NER Experiments with ModernBERT☆17Updated 2 months ago
- A simple client for doccano API.☆84Updated 9 months ago
- ☆42Updated last year
- Complete lxml external type annotation☆50Updated this week
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- Parse numbers written in natural language☆109Updated 4 months ago
- ☆30Updated 2 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- Annotation tool on Jupyter for Named Entity Recognition tasks☆21Updated last year