InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data
☆50Mar 6, 2026Updated last week
Alternatives and similar repositories for training
Users that are interested in training are comparing it to the libraries listed below
Sorting:
- Python library for Synthetic Data Generation☆52Feb 16, 2026Updated last month
- Python library for Evaluation☆16Feb 16, 2026Updated last month
- Taxonomy tree that will allow you to create models tuned with your data☆292Sep 8, 2025Updated 6 months ago
- Place to hack on UI for InstructLab☆36Feb 11, 2026Updated last month
- InstructLab Core package. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data…☆1,410Feb 16, 2026Updated last month
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- ☆12Dec 23, 2024Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Sep 18, 2025Updated 6 months ago
- Measuring General Intelligence With Generated Games (Preprint)☆25Jul 30, 2025Updated 7 months ago
- library for measuring communication in distributed-memory parallel applications that use the standard Message-Passing Interface (MPI)☆22Sep 17, 2025Updated 6 months ago
- ☆19Jan 31, 2022Updated 4 years ago
- Estimate resources needed to train LLMs☆14Feb 10, 2026Updated last month
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12May 14, 2024Updated last year
- A Python library for inference-time scaling LLMs☆32Mar 12, 2026Updated last week
- Kontalk Public Network☆15Sep 5, 2018Updated 7 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 10 months ago
- A list where most values will be None (or default)☆11Jul 19, 2023Updated 2 years ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Jun 5, 2025Updated 9 months ago
- DO480 Repository for Sample Applications☆12Aug 21, 2024Updated last year
- OpenShift Multi-Cluster Management Handbook, published by Packt☆14Dec 4, 2023Updated 2 years ago
- Workflow, visualizations and data services for managing NGO projects and programs☆11Dec 16, 2022Updated 3 years ago
- ☆19Jan 28, 2026Updated last month
- AlmaLinux OS SBOM data management utility.☆16Jan 20, 2026Updated 2 months ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- This repo should serve as a central source for users to raise issues/questions/requests for Operate First.☆15Dec 6, 2022Updated 3 years ago
- ☆38Sep 6, 2021Updated 4 years ago
- Simple MapReduce implementation in Python, for text file parallel processing☆20Mar 3, 2012Updated 14 years ago
- ☆136Dec 20, 2025Updated 2 months ago
- ☆20Jan 29, 2025Updated last year
- A set of tools to create synthetically-generated data from documents☆42Aug 15, 2025Updated 7 months ago
- Confidential inference in enclave for OpenAI grant. Uses k3s and Triton☆15Mar 20, 2025Updated 11 months ago
- ☆42Mar 11, 2026Updated last week
- Semantic prefix map registry☆13Feb 20, 2026Updated 3 weeks ago
- ☆15Sep 18, 2025Updated 6 months ago
- create multiple live distro on usb - ruby version☆13Nov 18, 2013Updated 12 years ago
- Yet Another YAML Parser, in pure python.☆18Jun 11, 2016Updated 9 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- DoCO, the Document Components Ontology, is an ontology for describing the component parts of a bibliographic document. It forms part of S…☆16Sep 7, 2019Updated 6 years ago
- A collection of regular expressions to identify references to state laws.☆19Sep 28, 2015Updated 10 years ago