impira / docqueryLinks
An easy way to extract information from documents
☆1,776Updated 2 years ago
Alternatives and similar repositories for docquery
Users that are interested in docquery are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,951Updated 2 weeks ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,455Updated 9 months ago
- Classify and extract structured data with LLMs☆426Updated 2 years ago
- Open-source natural language enrichments at your fingertips.☆460Updated 8 months ago
- Transforms PDF, Documents and Images into Enriched Structured Data☆6,009Updated last year
- The simplest way to serve AI/ML models in production☆1,064Updated this week
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,537Updated last year
- Structured and typehinted GPT responses in Python☆743Updated last year
- LLM(😽)☆1,688Updated 7 months ago
- 🦘 Explore multimedia datasets at scale☆1,064Updated 9 months ago
- 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative☆902Updated 2 years ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,725Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,266Updated 5 months ago
- 🦙 Integrating LLMs into structured NLP pipelines☆1,309Updated 8 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆771Updated 2 years ago
- AI code-writing assistant that understands data content☆2,289Updated last year
- 👻 Experimental library for scraping websites using OpenAI's GPT API.☆1,442Updated 2 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,397Updated last week
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,027Updated 6 months ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,154Updated 2 months ago
- ☆381Updated last year
- A curated list of resources for Document Understanding (DU) topic☆1,456Updated 2 years ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,814Updated last year
- Explore and understand your training and validation data.☆845Updated 8 months ago
- Software that makes labeling PDFs easy.☆420Updated last year
- Improved file parsing for LLM’s☆3,056Updated 10 months ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆790Updated last month
- Database system for AI-powered apps☆2,683Updated last year
- The InboxSDK lets you build apps for Gmail.☆798Updated last month
- A Python library to extract tabular data from PDFs☆3,421Updated last week