moritztng / fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
☆374Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for fltr
- Stateful load balancer custom-tailored for llama.cpp☆563Updated this week
- Replace OpenAI with Llama.cpp Automagically.☆289Updated 5 months ago
- Web UI for ExLlamaV2☆445Updated last month
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- Stop messing around with finicky sampling parameters and just use DRµGS!☆318Updated 5 months ago
- A fast batching API to serve LLM models☆172Updated 6 months ago
- LLM-powered lossless compression tool☆252Updated 3 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆333Updated 5 months ago
- A multimodal, function calling powered LLM webui.☆208Updated last month
- An implementation of bucketMul LLM inference☆214Updated 4 months ago
- From anywhere you can type, query and stream the output of an LLM or any other script☆474Updated 7 months ago
- Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.☆347Updated 3 weeks ago
- Mistral7B playing DOOM☆122Updated 4 months ago
- An AI assistant beyond the chat box.☆315Updated 8 months ago
- LLM Frontend in a single html file☆259Updated 2 weeks ago
- Rust framework for LLM orchestration☆198Updated 3 months ago
- ☆162Updated 5 months ago
- LLaVA server (llama.cpp).☆177Updated last year
- ☆227Updated last month
- Minimal LLM inference in Rust☆925Updated 3 weeks ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- GGUF implementation in C as a library and a tools CLI program☆244Updated 4 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆493Updated 3 months ago
- AI management tool☆107Updated last week
- LLM Analytics☆615Updated last month
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- ☆149Updated 4 months ago
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆416Updated this week
- Open source alternative to Perplexity AI with ability to run locally☆150Updated last month