cnmoro / MiniVectorDB
Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model
☆23Updated 6 months ago
Alternatives and similar repositories for MiniVectorDB:
Users that are interested in MiniVectorDB are comparing it to the libraries listed below
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated last month
- ☆15Updated last week
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 6 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆65Updated last year
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated 3 weeks ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- ☆40Updated 2 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 8 months ago
- ☆17Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆86Updated 3 weeks ago
- ☆53Updated 10 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆33Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- The backend behind the LLM-Perf Leaderboard☆10Updated 11 months ago
- Very minimal (and stateless) agent framework☆41Updated 2 months ago
- Pre-train Static Word Embeddings☆52Updated last month
- Universal text classifier for generative models☆22Updated 8 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 8 months ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 7 months ago
- ☆24Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- Sentence Embedding as a Service☆15Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆44Updated last month