Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
☆386Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for fltr
Users that are interested in fltr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆45May 5, 2024Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Draft42 - Streamlit chatbot with function calling☆32Jul 6, 2025Updated 10 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆631Mar 9, 2026Updated 2 months ago
- Dockerfile for johnsmith0031/alpaca_lora_4bit☆12Apr 10, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated 2 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆90Jul 26, 2024Updated last year
- ☆12Dec 19, 2023Updated 2 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- An AI assistant beyond the chat box.☆330Mar 11, 2024Updated 2 years ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆936Updated this week
- A pipeline parallel training script for LLMs.☆168Apr 30, 2025Updated last year
- A guidance compatibility layer for llama-cpp-python☆37Sep 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Minimal LLM inference in Rust☆1,034Oct 24, 2024Updated last year
- Bamboo-7B Large Language Model☆94Mar 28, 2024Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,521Mar 4, 2026Updated 2 months ago
- spotify/annoy bindings for Rust.☆19May 2, 2023Updated 3 years ago
- Fast, flexible LLM inference☆7,130Apr 15, 2026Updated last month
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆98Apr 2, 2026Updated last month
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Complex RAG backend☆29Mar 28, 2024Updated 2 years ago
- ai-validator is a powerful library that helps to extract and validate structured data from the output text of language models.☆16May 23, 2023Updated 2 years ago
- sdk for openai compatible API☆46Apr 5, 2024Updated 2 years ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆66Oct 9, 2023Updated 2 years ago
- From anywhere you can type, query and stream the output of any script (e.g. an LLM)☆504Apr 12, 2024Updated 2 years ago
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated 2 years ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆761Mar 4, 2025Updated last year
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆32Dec 29, 2023Updated 2 years ago
- Large-scale LLM inference engine☆1,727May 8, 2026Updated last week
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 10 months ago
- A multimodal, function calling powered LLM webui.☆213Sep 23, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆200Mar 18, 2026Updated 2 months ago
- ☆15Jul 1, 2024Updated last year
- Kubernetes Operator for Azure DevOps Agents☆15Updated this week