dswang2011 / DocLLM
DocLLM: A layout-aware generative language model for multimodal document understanding
☆113Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for DocLLM
- ☆21Updated 8 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆133Updated last month
- ☆162Updated 3 weeks ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆146Updated 7 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆126Updated 5 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆85Updated 4 months ago
- ☆131Updated 4 months ago
- Repository for deepdoctection tutorial notebooks☆39Updated 4 months ago
- A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.☆182Updated 4 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆50Updated 8 months ago
- ☆105Updated last month
- Data extraction with Donut ML model☆56Updated 3 months ago
- ☆98Updated 7 months ago
- ☆182Updated this week
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆68Updated last week
- This repository implements the chain of verification paper by Meta AI☆157Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆96Updated 7 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆47Updated 2 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆39Updated 8 months ago
- A Python client for the Unstructured hosted API☆82Updated this week
- Data extraction with LLM on CPU☆110Updated 10 months ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆69Updated last month
- DSPY on action with OpenSource LLMs.☆57Updated 7 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆197Updated 6 months ago
- ☆64Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated 3 weeks ago
- ☆78Updated this week
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated this week
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆265Updated 2 weeks ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated this week