explosion / prodigy-pdfLinks
A Prodigy plugin for PDF annotation
β33Updated 4 months ago
Alternatives and similar repositories for prodigy-pdf
Users that are interested in prodigy-pdf are comparing it to the libraries listed below
Sorting:
- 𦦠weasel: A small and easy workflow systemβ85Updated last year
- A spaCy wrapper for GliNERβ118Updated 6 months ago
- Plug-and-play document processing pipelines with zero-shot models.β85Updated this week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β113Updated 2 weeks ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β229Updated last month
- Generalist and Lightweight Model for Text Classificationβ148Updated last month
- spaCy entry points for Curated Transformersβ32Updated 2 months ago
- β27Updated last year
- β77Updated 8 months ago
- Repo to experiment with Graph RAG strategies using KΓΉzuβ56Updated 8 months ago
- Robust and fast topic models with sentence-transformers.β73Updated last month
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ76Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated last year
- Docx tracked change redlines for the Python ecosystem.β74Updated last year
- Widgets to make it easy to add labelsβ29Updated this week
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β83Updated 7 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β52Updated 9 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- Python API for https://vespa.ai, the open big data serving engineβ133Updated this week
- ποΈ Highlight text in documentsβ109Updated 3 months ago
- Lightweight Nearest Neighbors with Flexible Backendsβ296Updated 2 weeks ago
- Small python package to measure OCR quality and other related metrics.β25Updated last year
- Simple UI for debugging correlations of text embeddingsβ288Updated 2 months ago
- Query language for blending SQL and LLMs across structured + unstructured data, with type constraints.β107Updated last week
- β55Updated last year
- An easy way to chunk spaCy docs.β21Updated 11 months ago
- NLP pipelines for Tagalog using spaCyβ60Updated 2 weeks ago
- A public repo that contains integrations for Argilla and LlamaIndex.β16Updated 9 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ24Updated last year