e-p-armstrong / augmentoolkit
Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.
☆1,409Updated 2 months ago
Alternatives and similar repositories for augmentoolkit:
Users that are interested in augmentoolkit are comparing it to the libraries listed below
- ☆849Updated 7 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,629Updated this week
- Large-scale LLM inference engine☆1,379Updated last week
- Optimizing inference proxy for LLMs☆2,150Updated 2 weeks ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆551Updated 2 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,375Updated 2 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,058Updated 2 weeks ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆771Updated 2 months ago
- High-performance retrieval engine for unstructured data☆1,316Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,209Updated this week
- Efficient visual programming for AI language models☆356Updated 7 months ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,770Updated last month
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆571Updated 3 weeks ago
- Build datasets using natural language☆453Updated last month
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆577Updated 5 months ago
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆527Updated this week
- Customizable implementation of the self-instruct paper.☆1,043Updated last year
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,721Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,042Updated 2 months ago
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated …☆1,144Updated last month
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆764Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,185Updated last week
- Generic rag framework to apply the power of LLMs on any given dataset☆592Updated 3 weeks ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆737Updated last month
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,407Updated 2 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,130Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,386Updated 2 months ago
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆640Updated last week
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆301Updated 10 months ago
- An OAI compatible exllamav2 API that's both lightweight and fast☆907Updated this week