cnmoro / MiniVectorDB
Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model
☆22Updated last month
Related projects ⓘ
Alternatives and complementary repositories for MiniVectorDB
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆36Updated 7 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆28Updated last month
- Pre-training code for CrystalCoder 7B LLM☆53Updated 6 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 2 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆54Updated 2 months ago
- ☆24Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆42Updated 8 months ago
- ☆36Updated 2 years ago
- Sentence Embedding as a Service☆14Updated last year
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- ☆40Updated this week
- ☆20Updated 9 months ago
- Writing Blog Posts with Generative Feedback Loops!☆42Updated 7 months ago
- Official repository for RAGVIZ: Diagnose and Visualize Retrieval-Augmented Generation☆21Updated this week
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Open Implementations of LLM Analyses☆94Updated last month
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- ☆11Updated 3 weeks ago
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- The backend behind the LLM-Perf Leaderboard☆11Updated 6 months ago
- Vector Database with support for late interaction and token level embeddings.☆53Updated last month
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Evaluation of bm42 sparse indexing algorithm☆60Updated 4 months ago