wjbmattingly / spacy-chunks
An easy way to chunk spaCy docs.
☆20Updated 8 months ago
Alternatives and similar repositories for spacy-chunks:
Users that are interested in spacy-chunks are comparing it to the libraries listed below
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Generalist and Lightweight Model for Text Classification☆123Updated this week
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Pre-train Static Word Embeddings☆58Updated 3 weeks ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆30Updated 3 weeks ago
- ☆54Updated last year
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- ☆17Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆46Updated 3 weeks ago
- A spaCy wrapper for GliNER☆114Updated 3 months ago
- Robust and fast topic models with sentence-transformers.☆48Updated last week
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated 9 months ago
- GLiNER model in a FastAPI microservice.☆42Updated 4 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆71Updated 9 months ago
- Python API for https://vespa.ai, the open big data serving engine☆121Updated this week
- ☆47Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- Plug-and-play document processing pipelines. No training. Batteries included.☆57Updated last week
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- C++ inference engine for running GLiNER (Generalist and Lightweight Named Entity Recognition) models☆28Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 8 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 8 months ago
- An LLM training library for instruction-tuning.☆25Updated last year