moritztng / fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
โ377Updated last year
Alternatives and similar repositories for fltr:
Users that are interested in fltr are comparing it to the libraries listed below
- Stateful load balancer custom-tailored for llama.cpp ๐๐ฆโ737Updated last week
- Stop messing around with finicky sampling parameters and just use DRยตGS!โ348Updated 10 months ago
- Visualize the intermediate output of Mistral 7Bโ354Updated 2 months ago
- From anywhere you can type, query and stream the output of an LLM or any other scriptโ493Updated last year
- A multimodal, function calling powered LLM webui.โ214Updated 6 months ago
- Web UI for ExLlamaV2โ491Updated 2 months ago
- Replace OpenAI with Llama.cpp Automagically.โ313Updated 10 months ago
- A fast batching API to serve LLM modelsโ182Updated 11 months ago
- LLM-powered lossless compression toolโ279Updated 7 months ago
- An implementation of bucketMul LLM inferenceโ216Updated 9 months ago
- โ163Updated 10 months ago
- GGUF implementation in C as a library and a tools CLI programโ264Updated 3 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM โฆโ551Updated last month
- Minimal LLM inference in Rustโ982Updated 5 months ago
- Mistral7B playing DOOMโ130Updated 9 months ago
- Fully neural approach for text chunkingโ26Updated this week
- function calling-based LLM agentsโ286Updated 7 months ago
- ai for jqโ240Updated 6 months ago
- An AI assistant beyond the chat box.โ325Updated last year
- Open source alternative to Perplexity AI with ability to run locallyโ199Updated 6 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machinesโ122Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitโ761Updated 8 months ago
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibesโ194Updated last month
- Finetune llama2-70b and codellama on MacBook Air without quantizationโ448Updated last year
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.โ385Updated last month
- OpenAI compatible API for serving LLAMA-2 modelโ218Updated last year
- Rust+OpenCL+AVX2 implementation of LLaMA inference codeโ545Updated last year
- Run any ML model from any programming language.โ422Updated last year
- Rust framework for LLM orchestrationโ202Updated 8 months ago
- High-level, optionally asynchronous Rust bindings to llama.cppโ216Updated 10 months ago