cnmoro / MiniVectorDBLinks
Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model
☆23Updated last year
Alternatives and similar repositories for MiniVectorDB
Users that are interested in MiniVectorDB are comparing it to the libraries listed below
Sorting:
- Pre-train Static Word Embeddings☆94Updated 4 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆66Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Sentence Embedding as a Service☆15Updated 7 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- experiments with inference on llama☆103Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 11 months ago
- Benchmarking suite for popular AI APIs☆88Updated last year
- ☆198Updated last year
- ☆68Updated last year
- A collection of reproducible inference engine benchmarks☆38Updated 9 months ago
- Evaluation of bm42 sparse indexing algorithm☆72Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆56Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated 4 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆20Updated last year
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- A framework for evaluating function calls made by LLMs☆40Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Updated 3 weeks ago
- ☆23Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆44Updated last year
- ☆53Updated 11 months ago