MaxHalford / clavier
🔤 Measure edit distance based on keyboard layout
☆58Updated last year
Related projects ⓘ
Alternatives and complementary repositories for clavier
- Generate reports for spaCy models.☆28Updated 2 years ago
- A python package to simulate typographical errors.☆31Updated 11 months ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- ☆29Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆29Updated last month
- real time recommendation playground☆15Updated 2 years ago
- The simplest way to deploy a machine learning model☆23Updated 2 years ago
- ✨🌲 Hierarchical extreme multiclass and multi-label classification.☆17Updated last year
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- A library to instantiate any Python object from configuration files.☆24Updated 2 years ago
- ☆67Updated 2 years ago
- captures logs and makes cron more fun☆71Updated 2 months ago
- 🐍 Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm library☆68Updated 3 years ago
- Library for fast text representation and classification.☆28Updated 10 months ago
- It's a cooler way to store simple linear models.☆28Updated 4 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated 3 weeks ago
- Bag of, not words, but tricks!☆68Updated last year
- 🕊️ Radically lightweight command-line interfaces☆103Updated last year
- Confection: the sweetest config system for Python☆176Updated 5 months ago
- Vectorizers for a range of different data types☆97Updated this week
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆13Updated last month
- Pipeline components that support partial_fit.☆43Updated 4 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆65Updated 3 months ago
- A Toolbox for the Evaluation of machine learning Explanations☆15Updated 10 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Full text search in your Pandas dataframe☆209Updated 3 weeks ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago