MaartenGr / PolyFuzz
Fuzzy string matching, grouping, and evaluation.
☆744Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for PolyFuzz
- skweak: A software toolkit for weak supervision applied to NLP tasks☆918Updated 2 months ago
- Super Fast String Matching in Python☆364Updated 6 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆397Updated 3 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 7 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆242Updated last year
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆551Updated 2 months ago
- Natural Intelligence is still a pretty good idea.☆796Updated 3 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆469Updated last year
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆870Updated this week
- ✔️Contextual word checker for better suggestions (not actively maintained)☆409Updated 3 weeks ago
- Textpipe: clean and extract metadata from text☆300Updated 3 years ago
- Gain clues from clustering!☆304Updated 3 months ago
- just a bunch of useful embeddings☆466Updated last month
- Doubt your data, find bad labels.☆504Updated 3 months ago
- Fuzzy matching and more functionality for spaCy.☆252Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆313Updated last month
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆253Updated this week
- A Simple Bulk Labelling Tool☆550Updated 2 months ago
- Fixes contractions such as `you're` to `you are`☆312Updated last year
- 🧹 Python package for text cleaning☆957Updated last year
- Compute Sentence Embeddings Fast!☆618Updated last year
- A Python module to convert natural language numerics into ints and floats.☆222Updated last month
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated 5 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,351Updated 5 months ago
- PYthon Automated Term Extraction☆305Updated last year
- Active Learning for Text Classification in Python☆560Updated this week
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆320Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆286Updated last year
- 👑 spaCy building blocks and visualizers for Streamlit apps☆804Updated 3 months ago