parsee-ai / parsee-pdf-reader
Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-paragraphs. Full support for scans and images.
☆58Updated 2 months ago
Alternatives and similar repositories for parsee-pdf-reader:
Users that are interested in parsee-pdf-reader are comparing it to the libraries listed below
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆67Updated last week
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆43Updated last year
- ☆22Updated last year
- ☆120Updated last month
- ☆45Updated 11 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- Agentic RAG with Langchain, Qdrant and CrewAI☆53Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆88Updated this week
- A tutorial on DSPy and whether automated prompt engineering lives up to the hype☆22Updated 11 months ago
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆21Updated last year
- ☆29Updated last year
- Repository for deepdoctection tutorial notebooks☆43Updated 4 months ago
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆37Updated 10 months ago
- ☆101Updated 11 months ago
- ☆20Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated 6 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆84Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Example LangGraph flow that does "competitor analysis" on the web.☆28Updated 10 months ago
- Rag Chatbot React And Tyepscript base boilerplate☆33Updated 11 months ago
- ☆13Updated last year
- ☆18Updated last year
- A RAG that can scale 🧑🏻💻☆11Updated 10 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆46Updated 5 months ago
- Convert Everything to PDF☆17Updated last week
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆34Updated 2 weeks ago