impira / docquery
An easy way to extract information from documents
☆1,729Updated last year
Alternatives and similar repositories for docquery:
Users that are interested in docquery are comparing it to the libraries listed below
- A Repo For Document AI☆2,659Updated this week
- LLM(😽)☆1,643Updated last week
- Improved file parsing for LLM’s☆2,637Updated 2 months ago
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆5,968Updated 6 months ago
- Classify and extract structured data with LLMs☆419Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,527Updated 10 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆770Updated last year
- A language for constraint-guided and efficient LLM programming.☆3,768Updated 7 months ago
- ✨ Build AI interfaces that spark joy☆5,420Updated this week
- Seamlessly integrate LLMs as Python functions☆2,151Updated this week
- Stealth browsers as a service. Connect your scraper or automation to a fleet of cloud-hosted browsers configured for reliability and stea…☆2,325Updated 2 months ago
- A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.☆1,384Updated 4 months ago
- Efficient few-shot learning with Sentence Transformers☆2,311Updated this week
- 🦙 Integrating LLMs into structured NLP pipelines☆1,168Updated last week
- Structured and typehinted GPT responses in Python☆735Updated 5 months ago
- The simplest way to serve AI/ML models in production☆934Updated this week
- Creative interactive views of any dataset.☆831Updated 3 weeks ago
- Adding guardrails to large language models.☆4,362Updated this week
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,405Updated last month
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,902Updated this week
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆304Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,154Updated 3 months ago
- 🦘 Explore multimedia datasets at scale☆1,052Updated last month
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…☆3,366Updated 10 months ago
- Database system for AI-powered apps☆2,650Updated 8 months ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,423Updated 6 months ago
- fast vector database made in numpy☆750Updated 8 months ago
- 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, …☆2,380Updated last year
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,760Updated 5 months ago
- λprompt - A functional programming interface for building AI systems☆376Updated 11 months ago