code-kern-ai / embedders
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
☆21Updated last year
Related projects: ⓘ
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- ☆41Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆53Updated last year
- ☆29Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- Vespa application making an index of the CORD-19 dataset.☆39Updated 2 weeks ago
- RaKUn 2.0 - A fast keyword detection algorithm☆61Updated last month
- Python package for deduplication/entity resolution using active learning☆77Updated 3 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆99Updated 4 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 5 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆61Updated 6 months ago
- ☆47Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆71Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 4 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 6 months ago
- Aim-spaCy integration☆34Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.☆0Updated last year
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆16Updated 2 months ago
- Using short models to classify long texts☆20Updated last year
- ☆22Updated 2 years ago