messense / fasttext-wheel
Build and upload fastText Python wheels to PyPI
☆23Updated last year
Alternatives and similar repositories for fasttext-wheel:
Users that are interested in fasttext-wheel are comparing it to the libraries listed below
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆68Updated 2 weeks ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Pythonic search engine based on PyLucene.☆125Updated 3 months ago
- Confection: the sweetest config system for Python☆182Updated 8 months ago
- ☆168Updated 8 months ago
- 80x faster and 95% accurate language identification with Fasttext☆146Updated last year
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- Fast Levenshtein Distance Library for Python 3☆82Updated 2 years ago
- My NER Experiments with ModernBERT☆17Updated last month
- provides a common interface to many IR measure tools☆80Updated 2 months ago
- A file utility for accessing both local and remote files through a unified interface.☆37Updated last month
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆171Updated last week
- A Python implementation of Lunr.js 🌖☆195Updated last month
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆22Updated last year
- ISO 639 language codes☆39Updated last week
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependencies☆55Updated 2 years ago
- Parse natural language time expressions in python☆131Updated 2 years ago
- Open source library for few shot NLP☆77Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆134Updated last month
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆65Updated last year
- ☆42Updated last year
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆52Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆107Updated last month
- Unofficial faiss wheel builder☆304Updated last week
- ☆68Updated 2 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆96Updated last year
- ☆63Updated 2 months ago
- Bi-encoder entity linking architecture☆44Updated 5 months ago