KevKibe / memvectordbLinks
β‘οΈLightning fast in-memory VectorDB written in rustπ¦
β23Updated 5 months ago
Alternatives and similar repositories for memvectordb
Users that are interested in memvectordb are comparing it to the libraries listed below
Sorting:
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rustβ80Updated last year
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.β62Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedbackβ101Updated 5 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Faceβ37Updated last year
- Light WebUI for lm.rsβ24Updated 10 months ago
- Ask shortgpt for instant and concise answersβ13Updated 2 years ago
- Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly aβ¦β73Updated 2 months ago
- β138Updated last year
- Rust implementation of Suryaβ60Updated 5 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rustβ38Updated 2 years ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRβ¦β39Updated last year
- OpenAI compatible API for serving LLAMA-2 modelβ218Updated last year
- Fast serverless LLM inference, in Rust.β88Updated 5 months ago
- Run AI models anywhere. https://muna.ai/exploreβ63Updated this week
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β75Updated 2 years ago
- β26Updated 8 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOSβ36Updated last year
- Simple orchestration for EC2 spot containersβ19Updated 11 months ago
- A CLI in Rust to generate synthetic data for MLX friendly trainingβ24Updated last year
- A Fish Speech implementation in Rust, with Candle.rsβ94Updated 2 months ago
- implement llava using candleβ15Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structureβ51Updated 10 months ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manualβ22Updated last year
- β10Updated 2 years ago
- Ask questions, get insights from reposβ82Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)β63Updated 9 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Neural search for web-sites, docs, articles - online!β138Updated 3 weeks ago
- Using modal.com to process FineWeb-edu dataβ20Updated 4 months ago