impira / docquery
An easy way to extract information from documents
☆1,753Updated 2 years ago
Alternatives and similar repositories for docquery:
Users that are interested in docquery are comparing it to the libraries listed below
- A Repo For Document AI☆2,810Updated 3 weeks ago
- Turn expensive prompts into cheap fine-tuned models☆2,586Updated 11 months ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,437Updated 4 months ago
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.☆1,398Updated 3 months ago
- LLM(😽)☆1,669Updated 3 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,715Updated last year
- Classify and extract structured data with LLMs☆425Updated last year
- Open-source natural language enrichments at your fingertips.☆459Updated 3 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,228Updated last month
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,217Updated 9 months ago
- An LLM-powered advanced RAG pipeline built from scratch☆836Updated last year
- Seamlessly integrate LLMs as Python functions☆2,289Updated last week
- 🦙 Integrating LLMs into structured NLP pipelines☆1,240Updated 4 months ago
- Structured and typehinted GPT responses in Python☆738Updated 9 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆772Updated last year
- The simplest way to serve AI/ML models in production☆984Updated this week
- AI code-writing assistant that understands data content☆2,258Updated last year
- 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative☆895Updated last year
- ✨ AI agents that spark joy☆5,674Updated this week
- Python package for easily interfacing with chat apps, with robust features and minimal code complexity.☆3,517Updated 10 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,595Updated 10 months ago
- A language for constraint-guided and efficient LLM programming.☆3,922Updated 11 months ago
- Data processing with ML, LLM and Vision LLM☆4,516Updated this week
- Improved file parsing for LLM’s☆2,943Updated 5 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,843Updated 8 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆316Updated last year
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,493Updated last year
- Accurate answers and instant citations for your documents.☆1,650Updated 11 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,014Updated 2 months ago
- Whisper as a Service (GUI and API with queuing for OpenAI Whisper)☆1,904Updated last week