e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,804Updated 2 months ago
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Optimizing inference proxy for LLMs☆3,288Updated last month
- Large-scale LLM inference engine☆1,631Updated last week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆612Updated 11 months ago
- Software to implement GoT with a weviate vectorized database☆680Updated 10 months ago
- ☆1,186Updated last month
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,642Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆628Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,068Updated last week
- Chat language model that can use tools and interpret the results☆1,592Updated last month
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive…☆730Updated last year
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆799Updated 3 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,827Updated 8 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,590Updated last month
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆681Updated 10 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,115Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,684Updated 8 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,979Updated 5 months ago
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆926Updated 3 weeks ago
- function calling-based LLM agents☆289Updated last year
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,553Updated last year
- Synthetic data curation for post-training and structured data extraction☆1,609Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,435Updated 2 months ago
- High-performance retrieval engine for unstructured data☆1,551Updated 2 months ago
- Querying local documents, powered by LLM☆641Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆841Updated 11 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆314Updated last year
- WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.☆1,548Updated 8 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆761Updated 10 months ago
- ☆3,069Updated 2 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,426Updated last month