LouisTsiattalou / tfidf_matcher
TFIDF / KNN based string matching
☆49Updated last year
Related projects ⓘ
Alternatives and complementary repositories for tfidf_matcher
- Fuzzy matching and more functionality for spaCy.☆252Updated 4 months ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 7 months ago
- Super Fast String Matching in Python☆364Updated 6 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆254Updated 2 weeks ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆242Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆115Updated 7 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆139Updated last month
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆118Updated 6 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 7 months ago
- Simplifies use of the Dedupe library via Pandas☆136Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆154Updated last year
- Dataframe Integration with spaCy.☆101Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆209Updated 5 months ago
- Nesta's Skills Extractor Library☆123Updated 3 weeks ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆281Updated 2 years ago
- Spacy NER annotator using ipywidgets☆121Updated 7 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆192Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- Zero and Few shot named entity & relationships recognition☆349Updated 2 months ago
- ☆135Updated this week
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆262Updated 7 months ago
- Clustering sentence embeddings to extract message intent☆167Updated 3 years ago
- A Corpus of 475,000 Industrial Occupations☆63Updated 4 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year