instructlab / sdgLinks
Python library for Synthetic Data Generation
β48Updated last week
Alternatives and similar repositories for sdg
Users that are interested in sdg are comparing it to the libraries listed below
Sorting:
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β48Updated last week
- Python library for Evaluationβ15Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ61Updated this week
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β208Updated this week
- Taxonomy tree that will allow you to create models tuned with your dataβ281Updated last week
- codebase release for EMNLP2023 paper publicationβ19Updated 4 months ago
- β262Updated 2 months ago
- β43Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.β112Updated last week
- Advanced Reasoning Benchmark Dataset for LLMsβ47Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ110Updated 9 months ago
- LM engine is a library for pretraining/finetuning LLMsβ66Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β75Updated 9 months ago
- β48Updated last year
- Train, tune, and infer Bamba modelβ132Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β95Updated this week
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Let's build better datasets, together!β263Updated 8 months ago
- Your buddy in the (L)LM space.β64Updated 11 months ago
- Pre-train Static Word Embeddingsβ85Updated last week
- β67Updated last year
- Pre-training code for CrystalCoder 7B LLMβ55Updated last year
- β31Updated 10 months ago
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ225Updated this week
- Synthetic Data Generation Toolkit for LLMsβ52Updated this week
- Seemless interface of using PyTOrch distributed with Jupyter notebooksβ49Updated 2 weeks ago
- The Foundation Model Transparency Indexβ82Updated last year
- β135Updated 3 weeks ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ103Updated last year