e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,774Updated 2 weeks ago
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Large-scale LLM inference engine☆1,596Updated this week
- ☆1,132Updated last year
- Optimizing inference proxy for LLMs☆3,157Updated this week
- Software to implement GoT with a weviate vectorized database☆679Updated 7 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆609Updated 9 months ago
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive…☆714Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,932Updated this week
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆788Updated last month
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,549Updated last week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,958Updated 2 months ago
- Chat language model that can use tools and interpret the results☆1,586Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,533Updated 6 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆621Updated last year
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,090Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆671Updated 8 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,516Updated 6 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,422Updated last year
- Generic rag framework to apply the power of LLMs on any given dataset☆659Updated 2 months ago
- Web UI for ExLlamaV2☆514Updated 9 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,775Updated 6 months ago
- LLM for Long Text Summary (Comprehensive Bulleted Notes)☆601Updated 4 months ago
- Deploy your agentic worfklows to production☆2,061Updated 2 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆315Updated last year
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,372Updated last week
- High-performance retrieval engine for unstructured data☆1,531Updated last week
- Harness LLMs with Multi-Agent Programming☆3,772Updated 2 weeks ago
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆670Updated 10 months ago
- LLM Frontend in a single html file☆665Updated this week
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,121Updated last month
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆414Updated last week