parsee-ai / parsee-pdf-readerLinks
Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.
☆62Updated 2 weeks ago
Alternatives and similar repositories for parsee-pdf-reader
Users that are interested in parsee-pdf-reader are comparing it to the libraries listed below
Sorting:
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆73Updated 2 weeks ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆47Updated last year
- ☆101Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆128Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆85Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆22Updated last year
- ☆66Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.☆45Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated 10 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆70Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- ☆122Updated 6 months ago
- ☆45Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuning☆36Updated 11 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆178Updated 11 months ago
- A simple tool that serves as a knowledge graph explorer utilizing the GPT 3.5 turbo model to help users explore information in an organiz…☆59Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- Repository for deepdoctection tutorial notebooks☆46Updated 2 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Updated last year
- ☆20Updated last year
- AI real estate agent☆35Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆115Updated last month
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated this week
- Build document-native LLM applications☆54Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 8 months ago
- Using LlamaIndex, Redis, and OpenAI to chat with PDF documents. Supplementary material for blog post on Microsoft Developer Blog☆114Updated last year