dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆125Updated last year
Alternatives and similar repositories for DocLLM:
Users that are interested in DocLLM are comparing it to the libraries listed below
- ☆22Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆102Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆164Updated 7 months ago
- Repository for deepdoctection tutorial notebooks☆44Updated 5 months ago
- ☆143Updated 9 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆49Updated 6 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆96Updated 9 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆197Updated this week
- DSPY on action with OpenSource LLMs.☆70Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- ☆177Updated last week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆64Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated 2 weeks ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆106Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆123Updated 2 weeks ago
- Object Detection Model for Scanned Documents☆91Updated last month
- Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.☆188Updated last month
- Simple package to extract text with coordinates from programmatic PDFs☆109Updated 2 weeks ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆160Updated 10 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆108Updated 2 weeks ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- ☆74Updated 3 months ago
- A new novel multi-modality (Vision) RAG architecture☆25Updated 6 months ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated 6 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆48Updated 3 months ago
- ☆60Updated last year