e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,794Updated last month
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Large-scale LLM inference engine☆1,610Updated last month
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆610Updated 10 months ago
- Optimizing inference proxy for LLMs☆3,252Updated last week
- ☆1,166Updated 2 weeks ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆621Updated last year
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,408Updated last month
- Software to implement GoT with a weviate vectorized database☆680Updated 9 months ago
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive…☆724Updated last year
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆795Updated last month
- LLM for Long Text Summary (Comprehensive Bulleted Notes)☆602Updated 6 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,015Updated 2 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,595Updated this week
- Generic rag framework to apply the power of LLMs on any given dataset☆659Updated 3 weeks ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,103Updated 2 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,601Updated 3 weeks ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆681Updated 9 months ago
- High-performance retrieval engine for unstructured data☆1,545Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,805Updated 7 months ago
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆675Updated 11 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆761Updated 10 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,652Updated 7 months ago
- Web UI for ExLlamaV2☆514Updated 11 months ago
- WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.☆1,537Updated 8 months ago
- LLM Frontend in a single html file☆679Updated last week
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆551Updated last year
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,977Updated 4 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,586Updated 2 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- Harness LLMs with Multi-Agent Programming☆3,820Updated this week
- Querying local documents, powered by LLM☆638Updated 5 months ago