instructlab / sdgLinks
Python library for Synthetic Data Generation
β51Updated this week
Alternatives and similar repositories for sdg
Users that are interested in sdg are comparing it to the libraries listed below
Sorting:
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ44Updated this week
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β52Updated this week
- Python library for Evaluationβ16Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ62Updated 2 months ago
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated this week
- Taxonomy tree that will allow you to create models tuned with your dataβ287Updated 3 months ago
- β43Updated last year
- codebase release for EMNLP2023 paper publicationβ19Updated 2 months ago
- β266Updated 5 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.β122Updated 2 months ago
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated 2 years ago
- Train, tune, and infer Bamba modelβ137Updated 6 months ago
- LM engine is a library for pretraining/finetuning LLMsβ77Updated this week
- β49Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β76Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β225Updated last week
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ106Updated 2 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ110Updated 11 months ago
- β55Updated last year
- PageRank for LLMsβ51Updated 3 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing Systemβ151Updated last year
- Public repository containing METR's DVC pipeline for eval data analysisβ143Updated 8 months ago
- Your buddy in the (L)LM space.β64Updated last year
- The Foundation Model Transparency Indexβ82Updated last year
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.β26Updated 3 weeks ago
- Pre-training code for CrystalCoder 7B LLMβ55Updated last year
- β52Updated last year
- Training setup for Langchain's Open Deep Researchβ72Updated 3 months ago
- β31Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooksβ57Updated 2 months ago