impira / docqueryLinks
An easy way to extract information from documents
☆1,782Updated 2 years ago
Alternatives and similar repositories for docquery
Users that are interested in docquery are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆3,068Updated this week
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,459Updated 11 months ago
- Classify and extract structured data with LLMs☆428Updated 2 years ago
- LLM(😽)☆1,685Updated 9 months ago
- Open-source natural language enrichments at your fingertips.☆461Updated 10 months ago
- The simplest way to serve AI/ML models in production☆1,084Updated this week
- Structured and typehinted GPT responses in Python☆742Updated last year
- Transforms PDF, Documents and Images into Enriched Structured Data☆6,024Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,275Updated 7 months ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,033Updated 8 months ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆818Updated this week
- 🦘 Explore multimedia datasets at scale☆1,060Updated 11 months ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,215Updated last month
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,502Updated 2 years ago
- AI code-writing assistant that understands data content☆2,287Updated last year
- 🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative☆901Updated 2 years ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,041Updated this week
- Improved file parsing for LLM’s☆3,135Updated last year
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,777Updated last year
- 🦙 Integrating LLMs into structured NLP pipelines☆1,342Updated 10 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆330Updated 2 years ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆771Updated 2 years ago
- Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai☆4,989Updated this week
- Blazing fast framework for fine-tuning similarity learning models☆661Updated last month
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,661Updated last year
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,958Updated last year
- Prompt engineering for developers☆691Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,849Updated last year
- 👻 Experimental library for scraping websites using OpenAI's GPT API.☆1,443Updated 5 months ago
- Seamlessly integrate LLMs as Python functions☆2,379Updated last month