wjbmattingly / keyword-spacy
Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.
β11Updated last year
Alternatives and similar repositories for keyword-spacy:
Users that are interested in keyword-spacy are comparing it to the libraries listed below
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- A BERT-based application for reusable text classification at scaleβ38Updated last year
- β54Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- An easy way to chunk spaCy docs.β19Updated 8 months ago
- 𦦠weasel: A small and easy workflow systemβ83Updated 9 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β17Updated 8 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ161Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.β21Updated last year
- π’ Work with static vector modelsβ28Updated this week
- β18Updated last year
- Layout Analysis Dataset with Segmonto (LADaS)β20Updated 2 months ago
- NLP pipelines for Tagalog using spaCyβ54Updated 2 months ago
- Efficient few-shot learning with cross-encoders.β51Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 11 months ago
- β17Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 5 months ago
- Pipeline for converting PDFs to raw text with PaddleOCRβ23Updated last year
- β67Updated last year
- π Template Haystack Search Application with Streamlitβ27Updated 3 months ago
- spaCy entry points for Curated Transformersβ29Updated 6 months ago
- Tools for interactive visual exploration of semantic embeddings.β32Updated 7 months ago
- Pre-train Static Word Embeddingsβ56Updated 2 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 6 months ago
- Generalist and Lightweight Model for Text Classificationβ121Updated 2 weeks ago
- Universal text classifier for generative modelsβ24Updated 9 months ago
- πΊοΈ Data Cleaning and Textual Data Visualization πΊοΈβ168Updated 10 months ago
- π Fine-tune OpenAI models for text classification, question answering, and moreβ16Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- HDBSCAN Tuning for BERTopic Modelsβ45Updated last year