e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,700Updated 3 weeks ago
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Large-scale LLM inference engine☆1,502Updated this week
- ☆1,027Updated 10 months ago
- Optimizing inference proxy for LLMs☆2,722Updated 2 weeks ago
- Software to implement GoT with a weviate vectorized database☆673Updated 4 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆583Updated 5 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,842Updated this week
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,020Updated this week
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,867Updated this week
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆604Updated 9 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆495Updated last year
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆1,138Updated last week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,347Updated 2 weeks ago
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆741Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,344Updated 2 months ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,229Updated 2 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,474Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,457Updated 2 months ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆390Updated 3 months ago
- Customizable implementation of the self-instruct paper.☆1,048Updated last year
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆747Updated 5 months ago
- High-performance retrieval engine for unstructured data☆1,468Updated last week
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆466Updated this week
- Manifold is a platform for enabling workflow automation using AI assistants.☆456Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆807Updated 6 months ago
- Web UI for ExLlamaV2☆505Updated 6 months ago
- LLM for Long Text Summary (Comprehensive Bulleted Notes)☆580Updated last month
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,515Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,616Updated 2 months ago
- A multi-platform desktop application to evaluate and compare LLM models, written in Rust and React.☆805Updated 3 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆652Updated 4 months ago