Custom recipe and utilities for document processing
☆199Jun 19, 2022Updated 3 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Simple Bulk Labelling Tool☆597Jul 29, 2025Updated 9 months ago
- ☆16Oct 12, 2020Updated 5 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆104Feb 26, 2024Updated 2 years ago
- ☆62Nov 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Jun 13, 2022Updated 3 years ago
- ☆49Mar 30, 2023Updated 3 years ago
- A system for reading scanned documents and grouping them into high level topics☆14Aug 4, 2020Updated 5 years ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Jan 29, 2025Updated last year
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36May 18, 2023Updated 2 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Feb 2, 2022Updated 4 years ago
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatio…☆10Jan 25, 2026Updated 3 months ago
- RSS to Email Webapp (Python, AppEngine)☆18Jan 18, 2011Updated 15 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 💫 SpaCy wrapper for ConceptNet 💫☆96Dec 30, 2025Updated 4 months ago
- An OpenBB agent slack bot that is ready to answer any financial question☆12Feb 24, 2024Updated 2 years ago
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 3 weeks ago
- Reusable infrastructure modules for running TICK stack on GCP☆20Nov 19, 2025Updated 5 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆6,067Updated this week
- 🧪 Cutting-edge experimental spaCy components and features☆105Apr 23, 2024Updated 2 years ago
- ☆12Apr 18, 2026Updated 3 weeks ago
- 🪐 End-to-end NLP workflows from prototype to production☆1,428Oct 15, 2024Updated last year
- ☆13Oct 1, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Text analysis with networks.☆293Jan 16, 2026Updated 3 months ago
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 7 months ago
- 💥 Explosion Assets☆45Dec 10, 2023Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- Make PDFs easily☆324Mar 17, 2026Updated last month
- D application with imgui running in the browser☆11Dec 10, 2022Updated 3 years ago
- ☆11Feb 22, 2018Updated 8 years ago
- D programming language wrapper for InfluxDB☆11Sep 25, 2025Updated 7 months ago
- ☆11Aug 23, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Unified Toolkit for Deep Learning Based Document Image Analysis☆5,728Aug 15, 2024Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Oct 20, 2025Updated 6 months ago
- Some tools for cleaning up messy 'Excel' files to be suitable for R☆28Apr 25, 2026Updated 2 weeks ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- Tidymodels for Nested/Panel Data☆13Sep 30, 2023Updated 2 years ago
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆261Apr 13, 2026Updated 3 weeks ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆15Jan 25, 2026Updated 3 months ago