code-kern-ai / embeddersLinks
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
☆21Updated 2 months ago
Alternatives and similar repositories for embedders
Users that are interested in embedders are comparing it to the libraries listed below
Sorting:
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- ☆43Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆30Updated 3 years ago
- Simply, faster, sentence-transformers☆143Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 6 months ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 8 months ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆58Updated 9 months ago
- ☆55Updated last year
- Super Simple Similarities Service☆154Updated 5 months ago
- ☆70Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆73Updated last year
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆45Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆101Updated last year
- ☆69Updated 3 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- Information extraction from English and German texts based on predicate logic☆138Updated 2 years ago
- Creating class-based TF-IDF matrices☆89Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago