code-kern-ai / embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
☆21Updated last year
Alternatives and similar repositories for embedders:
Users that are interested in embedders are comparing it to the libraries listed below
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- ☆43Updated last year
- CLI-based tool to automatically build ML models from training data into a servable Docker container☆58Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆30Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 8 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- ☆22Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 7 months ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆77Updated 7 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- spaCy powered Label Studio ML backend☆29Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- A Streamlit component for annotating text by text selecting.☆40Updated 9 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆28Updated 3 months ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 3 years ago