cnmoro / MiniVectorDB
Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model
☆22Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for MiniVectorDB
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago
- ☆11Updated last month
- LLM reads a paper and produce a working prototype☆36Updated 2 weeks ago
- Sentence Embedding as a Service☆14Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- ☆14Updated 7 months ago
- ☆41Updated 2 weeks ago
- Evaluation of bm42 sparse indexing algorithm☆62Updated 4 months ago
- ☆44Updated 4 months ago
- ☆53Updated 5 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated last month
- ☆24Updated last year
- ☆41Updated 2 months ago
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆17Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated 2 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆41Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆51Updated last week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆28Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- The backend behind the LLM-Perf Leaderboard☆10Updated 6 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated last month
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Open Implementations of LLM Analyses☆94Updated last month