ljvmiranda921 / prodigy-pdf-custom-recipeView external linksLinks
Custom recipe and utilities for document processing
☆200Jun 19, 2022Updated 3 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below
Sorting:
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Nov 7, 2022Updated 3 years ago
- A Simple Bulk Labelling Tool☆599Jul 29, 2025Updated 6 months ago
- ☆15Oct 12, 2020Updated 5 years ago
- A system for reading scanned documents and grouping them into high level topics☆14Aug 4, 2020Updated 5 years ago
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- Transforms PDF, Documents and Images into Enriched Structured Data☆6,164Dec 3, 2023Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Feb 26, 2024Updated last year
- https://arxiv.org/pdf/1909.04054☆78Nov 2, 2022Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Jun 13, 2022Updated 3 years ago
- French Jurisprudences at your fingertips @ every 72h☆15Nov 18, 2025Updated 2 months ago
- NS-CQA: the model of the JWS paper 'Less is More: Data-Efficient Complex Question Answering over Knowledge Bases.' This work has been acc…☆22Jan 6, 2021Updated 5 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆402Jul 30, 2021Updated 4 years ago
- Entity Disambiguation as text extraction (ACL 2022)☆182Apr 17, 2022Updated 3 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Feb 2, 2022Updated 4 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatio…☆10Jan 25, 2026Updated 3 weeks ago
- D programming language wrapper for InfluxDB☆11Sep 25, 2025Updated 4 months ago
- An OpenBB agent slack bot that is ready to answer any financial question☆12Feb 24, 2024Updated last year
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Cricket analytics for humans 🏏☆12Sep 4, 2022Updated 3 years ago
- ☆10Aug 23, 2025Updated 5 months ago
- Course repository for the Fall 2021 COMP790 course "Information Theory" at UNC☆11Aug 24, 2021Updated 4 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Tensorflow 2.x implementation of Gradient Origin Networks☆12Jul 13, 2020Updated 5 years ago
- D application with imgui running in the browser☆11Dec 10, 2022Updated 3 years ago
- Repository for course material for Indian Knowledge System (IKS)☆13Jan 20, 2026Updated 3 weeks ago
- ☆12Feb 3, 2026Updated last week
- Material for the Pearson × O’Reilly Live Training Session "Hands-On Data Visualization with ggplot2: Concepts"☆11Aug 29, 2023Updated 2 years ago
- ☆11Feb 22, 2018Updated 7 years ago
- BookNLP, a natural language processing pipeline for books☆889Jul 31, 2024Updated last year
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated 2 weeks ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Oct 20, 2025Updated 3 months ago
- Tidymodels for Nested/Panel Data☆13Sep 30, 2023Updated 2 years ago
- Reproducible face stimuli☆17Feb 20, 2025Updated 11 months ago
- Some tools for cleaning up messy 'Excel' files to be suitable for R☆28Aug 22, 2022Updated 3 years ago
- ☆11Aug 9, 2022Updated 3 years ago
- ☆12Oct 1, 2020Updated 5 years ago
- My tidy tuesday kludges☆12Mar 29, 2023Updated 2 years ago
- ☆13Aug 13, 2020Updated 5 years ago