georgian-io / LLM-Finetuning-ToolkitLinks
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
β837Updated 7 months ago
Alternatives and similar repositories for LLM-Finetuning-Toolkit
Users that are interested in LLM-Finetuning-Toolkit are comparing it to the libraries listed below
Sorting:
- π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring saβ¦β913Updated 6 months ago
- Guide for fine-tuning Llama/Mistral/CodeLlama models and moreβ593Updated 3 weeks ago
- An LLM-powered advanced RAG pipeline built from scratchβ840Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 π―β948Updated last month
- Automatically evaluate your LLMs in Google Colabβ629Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diveβ¦β944Updated 7 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)β395Updated last year
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. β π€π€β1,017Updated 3 months ago
- A comprehensive guide to building RAG-based LLM applications for production.β1,796Updated 10 months ago
- Customizable implementation of the self-instruct paper.β1,042Updated last year
- β451Updated last year
- Automated Evaluation of RAG Systemsβ596Updated 2 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ321Updated 6 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.β776Updated 3 months ago
- Fine-Tuning Embedding for RAG with Synthetic Dataβ497Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,724Updated this week
- A tool for evaluating LLMsβ419Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β857Updated last year
- Easily embed, cluster and semantically label text datasetsβ540Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Modelsβ524Updated 11 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,390Updated this week
- Extend existing LLMs way beyond the original training length with constant memory usage, without retrainingβ697Updated last year
- Scale LLM Engine public repositoryβ799Updated 2 weeks ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuningβ307Updated 7 months ago
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddingsβ1,954Updated 4 months ago
- A tiny library for coding with large language models.β1,232Updated 10 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchβ¦β586Updated last year
- Efficient Retrieval Augmentation and Generation Frameworkβ1,558Updated 4 months ago
- Fine-tune mistral-7B on 3090s, a100s, h100sβ711Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.β477Updated 9 months ago