explosion / prodigy-pdf
A Prodigy plugin for PDF annotation
β29Updated last month
Alternatives and similar repositories for prodigy-pdf:
Users that are interested in prodigy-pdf are comparing it to the libraries listed below
- 𦦠weasel: A small and easy workflow systemβ75Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 9 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β79Updated 3 weeks ago
- Generalist and Lightweight Model for Text Classificationβ79Updated this week
- An easy way to chunk spaCy docs.β19Updated 6 months ago
- spaCy entry points for Curated Transformersβ26Updated 4 months ago
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated 11 months ago
- πΈ Train floret vectorsβ18Updated last year
- Efficient BM25 with DuckDB π¦β39Updated 2 months ago
- Repo to experiment with Graph RAG strategies using KΓΉzuβ44Updated 2 months ago
- β54Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated 6 months ago
- Knowledge Graph Generator appβ30Updated 10 months ago
- β29Updated 3 weeks ago
- π A Prodigy plugin for evaluating spaCy pipelinesβ13Updated 10 months ago
- Leverage your LangChain trace data for fine tuningβ41Updated 6 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.β48Updated 4 months ago
- A spaCy wrapper for GliNERβ108Updated 3 weeks ago
- A personal knowledge base that I can dump information to and help me learnβ24Updated 8 months ago
- β58Updated 3 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ23Updated 11 months ago
- Robust and fast topic models with sentence-transformers.β42Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ22Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β79Updated 2 months ago
- Playing with Python Bluesky SDKβ14Updated 3 months ago
- Tools for interactive visual exploration of semantic embeddings.β30Updated 5 months ago
- NLP with Rust for Python π¦πβ61Updated 8 months ago
- β67Updated 11 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ101Updated last year
- Train huggingface models on top of Prodigy annotationsβ21Updated last year