instructlab / sdgLinks
Python library for Synthetic Data Generation
☆51Updated 2 weeks ago
Alternatives and similar repositories for sdg
Users that are interested in sdg are comparing it to the libraries listed below
Sorting:
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆44Updated last week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆55Updated last month
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- Python library for Evaluation☆16Updated last week
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated last week
- Taxonomy tree that will allow you to create models tuned with your data☆289Updated 4 months ago
- ☆269Updated 6 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- ☆42Updated last year
- codebase release for EMNLP2023 paper publication☆19Updated 4 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- ☆55Updated last year
- ☆50Updated last year
- ☆51Updated 3 months ago
- ☆138Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- Pre-training code for CrystalCoder 7B LLM☆56Updated last year
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- Ongoing research training transformer models at scale☆42Updated this week
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- Synthetic Data Generation Toolkit for LLMs☆85Updated this week
- ☆31Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.☆127Updated 3 months ago
- ☆52Updated last year
- PyTorch implementation for MRL☆20Updated last year