dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆123Updated last year
Alternatives and similar repositories for DocLLM:
Users that are interested in DocLLM are comparing it to the libraries listed below
- ☆22Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆190Updated 2 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆46Updated 6 months ago
- ☆143Updated 8 months ago
- Fine-Tuning LLM and embedding models☆27Updated last year
- ☆73Updated 2 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆160Updated 6 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated 3 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated 11 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆95Updated 8 months ago
- DSPY on action with OpenSource LLMs.☆69Updated 11 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated 11 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆107Updated 3 weeks ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆139Updated 9 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆160Updated 10 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆93Updated 2 weeks ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆71Updated 2 weeks ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆265Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 6 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 3 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆63Updated 3 months ago
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆79Updated 2 years ago
- This repository implements the chain of verification paper by Meta AI☆166Updated last year
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆101Updated 8 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆43Updated last year
- ☆120Updated last month