AnswerDotAI / byaldiLinks

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

☆807

Alternatives and similar repositories for byaldi

Users that are interested in byaldi are comparing it to the libraries listed below

Sorting:

AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆318Updated 2 months ago
illuin-tech / colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,106Updated last week
PrithivirajDamodaran / FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…
☆842Updated last month
weaviate / recipes
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
☆814Updated this week
brandonstarxel / chunking_evaluation
This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…
☆373Updated 4 months ago
adithya-s-k / VARAG
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
☆477Updated 2 weeks ago
PragmaticMachineLearning / docai
Structured information extraction from documents
☆317Updated 10 months ago
jina-ai / late-chunking
Code for explaining and evaluating late chunking (chunked pooling)
☆427Updated 7 months ago
ganarajpr / awesome-dspy
An Awesome list of curated DSPy resources.
☆390Updated 5 months ago
xhluca / bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
☆1,267Updated 2 months ago
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆434Updated last year
lightonai / pylate
Late Interaction Models Training & Retrieval
☆521Updated 2 weeks ago
run-llama / multi-agent-concierge
An example of multi-agent orchestration with llama-index
☆429Updated 6 months ago
aurelio-labs / semantic-chunkers
☆231Updated last month
microsoft / sammo
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)
☆716Updated last month
huggingface / yourbench
🤗 Benchmark Large Language Models Reliably On Your Data
☆381Updated this week
D-Star-AI / dsRAG
High-performance retrieval engine for unstructured data
☆1,459Updated last week
isaacus-dev / semchunk
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
☆348Updated last month
MinishLab / semhash
Fast Semantic Text Deduplication & Filtering
☆774Updated 2 months ago
NVIDIA-AI-Blueprints / multimodal-pdf-data-extraction
NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG
☆344Updated 4 months ago
MinishLab / model2vec
Fast State-of-the-Art Static Embeddings
☆1,782Updated this week
diicellman / dspy-rag-fastapi
FastAPI wrapper around DSPy
☆258Updated last year
FutureClubNL / RAGMeUp
Generic rag framework to apply the power of LLMs on any given dataset
☆633Updated last month
deepset-ai / haystack-cookbook
👩🏻‍🍳 A collection of example notebooks using Haystack
☆490Updated this week
argilla-io / synthetic-data-generator
Build datasets using natural language
☆507Updated 2 months ago
merveenoyan / smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
☆1,546Updated 2 weeks ago
IntelLabs / RAG-FiT
Framework for enhancing LLMs for RAG tasks using fine-tuning.
☆747Updated 2 months ago
lotus-data / lotus
LOTUS: A semantic query engine for fast and easy LLM-powered data processing
☆1,256Updated last week
neuml / annotateai
📝 Automatically annotate papers using LLMs
☆332Updated 3 months ago