code-kern-ai / embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for embedders
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- ☆42Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- ☆29Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆61Updated 3 months ago
- Few-shot Named Entity Recognition☆122Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- ☆53Updated 10 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- Train huggingface models on top of Prodigy annotations☆20Updated 9 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆64Updated 3 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆103Updated 6 months ago
- Sentence transformers models for SpaCy☆105Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆88Updated 2 years ago
- CLI-based tool to automatically build ML models from training data into a servable Docker container☆57Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated last year
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- spaCy entry points for Curated Transformers☆25Updated last month
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.☆0Updated last year
- Repository for deepdoctection tutorial notebooks☆39Updated 4 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- A tool for quickly adding labels to unlabeled datasets☆20Updated 10 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- Just another sentiment wrapper.☆17Updated 2 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 8 months ago