Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
☆387Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for fltr
Users that are interested in fltr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆45May 5, 2024Updated 2 years ago
- an auto-sleeping and -waking framework around llama.cpp☆13Feb 8, 2025Updated last year
- Draft42 - Streamlit chatbot with function calling☆32Jul 6, 2025Updated 11 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆647Mar 9, 2026Updated 3 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆67Mar 15, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆91Jul 26, 2024Updated last year
- ☆12Dec 19, 2023Updated 2 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- An AI assistant beyond the chat box.☆329Mar 11, 2024Updated 2 years ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆941May 29, 2026Updated last month
- A pipeline parallel training script for LLMs.☆169Apr 30, 2025Updated last year
- A guidance compatibility layer for llama-cpp-python☆37Sep 11, 2023Updated 2 years ago
- Minimal LLM inference in Rust☆1,036Oct 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Bamboo-7B Large Language Model☆95Mar 28, 2024Updated 2 years ago
- Fast, flexible LLM inference☆7,362Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,567Mar 4, 2026Updated 3 months ago
- spotify/annoy bindings for Rust.☆19May 2, 2023Updated 3 years ago
- Generate Structured JSON with probs from Language Models☆17Mar 23, 2025Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆100Apr 2, 2026Updated 2 months ago
- Complex RAG backend☆29Mar 28, 2024Updated 2 years ago
- ai-validator is a powerful library that helps to extract and validate structured data from the output text of language models.☆16May 23, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- sdk for openai compatible API☆46Apr 5, 2024Updated 2 years ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆67Oct 9, 2023Updated 2 years ago
- From anywhere you can type, query and stream the output of any script (e.g. an LLM)☆504Apr 12, 2024Updated 2 years ago
- The one who calls upon functions - Function-Calling Language Model☆36Oct 2, 2023Updated 2 years ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆756Mar 4, 2025Updated last year
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- ☆32Dec 29, 2023Updated 2 years ago
- Large-scale LLM inference engine☆1,771May 8, 2026Updated last month
- AirLLM 70B inference with single 4GB GPU☆21Jun 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A multimodal, function calling powered LLM webui.☆213Sep 23, 2024Updated last year
- ☆15Jul 1, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 3 months ago
- Kubernetes Operator for Azure DevOps Agents☆15Updated this week
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Jun 16, 2023Updated 3 years ago
- AICI: Prompts as (Wasm) Programs☆2,078Jan 22, 2025Updated last year
- LLM-powered lossless compression tool☆314Jun 16, 2026Updated 2 weeks ago