illuin-tech / colpaliLinks
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,231Updated this week
Alternatives and similar repositories for colpali
Users that are interested in colpali are comparing it to the libraries listed below
Sorting:
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,543Updated 4 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆826Updated 8 months ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,425Updated last year
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,335Updated 3 weeks ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,116Updated 3 weeks ago
- High-performance retrieval engine for unstructured data☆1,502Updated 2 months ago
- RAGChecker: A Fine-grained Framework For Diagnosing RAG☆993Updated 9 months ago
- PyMuPDF4LLM☆1,058Updated 2 months ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,359Updated last week
- Fast State-of-the-Art Static Embeddings☆1,846Updated 2 weeks ago
- Developer APIs to Accelerate LLM Projects☆1,724Updated 11 months ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,380Updated 4 months ago
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆2,819Updated 3 weeks ago
- Parsing-free RAG supported by VLMs☆793Updated 7 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,252Updated 4 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,690Updated 4 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆453Updated 9 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,899Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,616Updated 2 weeks ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆749Updated 4 months ago
- Synthetic data curation for post-training and structured data extraction☆1,511Updated 2 months ago
- Knowledge Agents and Management in the Cloud☆4,151Updated last week
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆445Updated 3 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆863Updated 2 weeks ago
- Efficient Retrieval Augmentation and Generation Framework☆1,719Updated 8 months ago
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'☆1,596Updated 8 months ago
- Automated Evaluation of RAG Systems☆658Updated 6 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,526Updated 3 months ago
- Bringing BERT into modernity via both architecture changes and scaling☆1,525Updated 3 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆851Updated last week