impira / docqueryLinks
An easy way to extract information from documents
☆1,772Updated 2 years ago
Alternatives and similar repositories for docquery
Users that are interested in docquery are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆2,927Updated this week
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,452Updated 8 months ago
- LLM(😽)☆1,686Updated 6 months ago
- Structured and typehinted GPT responses in Python☆743Updated last year
- AI code-writing assistant that understands data content☆2,282Updated last year
- Open-source natural language enrichments at your fingertips.☆458Updated 7 months ago
- The simplest way to serve AI/ML models in production☆1,040Updated this week
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,023Updated 5 months ago
- 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative☆897Updated 2 years ago
- Classify and extract structured data with LLMs☆425Updated 2 years ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆326Updated last year
- Transforms PDF, Documents and Images into Enriched Structured Data☆5,998Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,797Updated last year
- 🦙 Integrating LLMs into structured NLP pipelines☆1,300Updated 7 months ago
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.☆1,401Updated 6 months ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆776Updated last week
- Improved file parsing for LLM’s☆3,044Updated 9 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆772Updated 2 years ago
- Explore large language models in 512MB of RAM☆1,195Updated 3 weeks ago
- Seamlessly integrate LLMs as Python functions☆2,359Updated 2 months ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,147Updated last month
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,490Updated last year
- Explore and understand your training and validation data.☆845Updated 8 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,261Updated 4 months ago
- Drive a browser with GPT-3☆1,926Updated last year
- A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.☆747Updated 10 months ago
- Zoomable, animated scatterplots in the browser that scales over a billion points☆1,129Updated 2 months ago
- 🦘 Explore multimedia datasets at scale☆1,065Updated 8 months ago
- ☆374Updated last year
- λprompt - A functional programming interface for building AI systems☆380Updated last year