e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,760Updated last month
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Optimizing inference proxy for LLMs☆3,060Updated 2 weeks ago
 - Large-scale LLM inference engine☆1,579Updated this week
 - ☆1,111Updated last year
 - WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆785Updated 3 weeks ago
 - An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆620Updated last year
 - Software to implement GoT with a weviate vectorized database☆678Updated 7 months ago
 - The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,071Updated 2 weeks ago
 - The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆602Updated 8 months ago
 - AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆758Updated 7 months ago
 - Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,912Updated this week
 - Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆316Updated last year
 - Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆518Updated last year
 - LLM Frontend in a single html file☆653Updated this week
 - Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆338Updated 8 months ago
 - Enforce the output format (JSON Schema, Regex etc) of a language model☆1,944Updated 2 months ago
 - function calling-based LLM agents☆289Updated last year
 - Chat language model that can use tools and interpret the results☆1,586Updated this week
 - An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆668Updated 9 months ago
 - 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆411Updated 5 months ago
 - Web UI for ExLlamaV2☆511Updated 8 months ago
 - Manifold is a platform for enabling workflow automation using AI assistants.☆464Updated last week
 - Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆667Updated 7 months ago
 - A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆853Updated 6 months ago
 - Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc☆1,764Updated this week
 - Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,527Updated last week
 - A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,353Updated 2 months ago
 - This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,354Updated 3 weeks ago
 - Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,526Updated 5 months ago
 - Efficient visual programming for AI language models☆361Updated 5 months ago
 - A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,562Updated 5 months ago