bentoml / BentoColPaliLinks
☆23Updated 9 months ago
Alternatives and similar repositories for BentoColPali
Users that are interested in BentoColPali are comparing it to the libraries listed below
Sorting:
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 10 months ago
- ☆12Updated last year
- ☆80Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆117Updated 8 months ago
- Python API for https://vespa.ai, the open big data serving engine☆158Updated this week
- ☆37Updated 8 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Updated last year
- MLFlow Deployment Plugin for Ray Serve☆46Updated 3 years ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆210Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- A library for minimizing the effects of confounding covariates☆15Updated 8 months ago
- Interactive Debugging for Retrieval Augmented Generation Pipelines☆32Updated 7 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆65Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆181Updated 8 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆70Updated last week
- Generalist and Lightweight Model for Text Classification☆169Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆83Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆60Updated 2 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Updated last year
- Iterate fast on your RAG pipelines☆24Updated 7 months ago
- Metaflow flows for analyzing topics and sentiments in Hacker News☆22Updated last year