moritztng / fltrLinks
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
☆386Updated last year
Alternatives and similar repositories for fltr
Users that are interested in fltr are comparing it to the libraries listed below
Sorting:
- Stop messing around with finicky sampling parameters and just use DRµGS!☆360Updated last year
- Mistral7B playing DOOM☆139Updated last year
- From anywhere you can type, query and stream the output of any script (e.g. an LLM)☆503Updated last year
- Web UI for ExLlamaV2☆513Updated last year
- Replace OpenAI with Llama.cpp Automagically.☆328Updated last year
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,447Updated this week
- A fast batching API to serve LLM models☆189Updated last year
- Visualize the intermediate output of Mistral 7B☆384Updated last year
- An AI assistant beyond the chat box.☆329Updated last year
- An implementation of bucketMul LLM inference☆224Updated last year
- LLM-powered lossless compression tool☆302Updated last month
- A multimodal, function calling powered LLM webui.☆216Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆242Updated 6 months ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- function calling-based LLM agents☆289Updated last year
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆65Updated 2 years ago
- All-in-one desktop app for running LLMs locally.☆463Updated 2 weeks ago
- Minimal LLM inference in Rust☆1,029Updated last year
- ☆166Updated last year
- git-like rag pipeline☆256Updated last month
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.☆435Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆450Updated last year
- ☆337Updated 6 months ago
- Run any ML model from any programming language.☆424Updated 2 years ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆425Updated 11 months ago
- ☆135Updated last year
- OpenAI compatible API for serving LLAMA-2 model☆218Updated 2 years ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆493Updated 2 years ago
- Live-bending a foundation model’s output at neural network level.☆273Updated 10 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 7 months ago