e-p-armstrong / augmentoolkitLinks
Create Custom LLMs
☆1,780Updated last month
Alternatives and similar repositories for augmentoolkit
Users that are interested in augmentoolkit are comparing it to the libraries listed below
Sorting:
- Large-scale LLM inference engine☆1,603Updated 2 weeks ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆620Updated last year
- ☆1,148Updated last year
- Optimizing inference proxy for LLMs☆3,221Updated last week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆610Updated 9 months ago
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,097Updated this week
- WilmerAI is one of the oldest LLM semantic routers. It uses multi-layer prompt routing and complex workflows to allow you to not only cre…☆791Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,966Updated this week
- Software to implement GoT with a weviate vectorized database☆680Updated 8 months ago
- LLM Frontend in a single html file☆670Updated 3 weeks ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆671Updated 8 months ago
- Generic rag framework to apply the power of LLMs on any given dataset☆659Updated 3 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆760Updated 9 months ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆344Updated 9 months ago
- Web UI for ExLlamaV2☆514Updated 10 months ago
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitive…☆720Updated last year
- An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.☆674Updated 10 months ago
- Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens☆539Updated last year
- Chat language model that can use tools and interpret the results☆1,588Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,566Updated 6 months ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,382Updated last month
- Querying local documents, powered by LLM☆635Updated 4 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,575Updated 2 weeks ago
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆423Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,518Updated 6 months ago
- LLM for Long Text Summary (Comprehensive Bulleted Notes)☆602Updated 5 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,965Updated 3 months ago
- function calling-based LLM agents☆289Updated last year
- This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)☆323Updated last year
- Customizable implementation of the self-instruct paper.☆1,050Updated last year