koaning / simsity
Super Simple Similarities Service
☆147Updated last month
Alternatives and similar repositories for simsity:
Users that are interested in simsity are comparing it to the libraries listed below
- Bag of, not words, but tricks!☆68Updated last year
- Pipeline components that support partial_fit.☆46Updated 8 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆495Updated 2 weeks ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores☆100Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆309Updated last year
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Doubt your data, find bad labels.☆510Updated 8 months ago
- Just another sentiment wrapper.☆17Updated 3 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Experiments for data quality in Rasa.☆34Updated 2 years ago
- KEN: Relational Data Embeddings☆29Updated last year
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- A visual labeling system implemented in Jupyter widgets.☆150Updated 5 months ago
- Vectorizers for a range of different data types☆101Updated 2 months ago
- ☆69Updated 3 years ago
- Toolkit for developing and maintaining ML models☆154Updated 10 months ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Decorators that logs stats.☆110Updated last month
- Python package for deduplication/entity resolution using active learning☆78Updated 7 months ago
- Streamline scikit-learn model comparison.☆145Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 7 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- SPEAR: Programmatically label and build training data quickly.☆106Updated 9 months ago