code-kern-ai / embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
☆21Updated last year
Alternatives and similar repositories for embedders:
Users that are interested in embedders are comparing it to the libraries listed below
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆30Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 2 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 8 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- A Streamlit component for annotating text by text selecting.☆40Updated 9 months ago
- ☆43Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A simple library for training named entity recognition model from partially annotated data☆23Updated last year
- ☆22Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆77Updated 7 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆39Updated last year
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28Updated 2 years ago
- ☆46Updated 2 years ago
- ☆54Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- spaCy entry points for Curated Transformers☆27Updated 6 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 10 months ago
- ☆19Updated 2 years ago
- Aim-spaCy integration☆34Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆78Updated last year
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Library for fast text representation and classification.☆28Updated last year