instructlab / sdgLinks
Python library for Synthetic Data Generation
β42Updated this week
Alternatives and similar repositories for sdg
Users that are interested in sdg are comparing it to the libraries listed below
Sorting:
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β44Updated this week
- Synthetic Data Generation Toolkit for LLMsβ26Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated 3 weeks ago
- Python library for Evaluationβ14Updated this week
- Place to hack on UI for InstructLabβ31Updated this week
- GitHub bot to assist with the taxonomy contribution workflowβ16Updated 7 months ago
- Taxonomy tree that will allow you to create models tuned with your dataβ267Updated this week
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β196Updated this week
- β41Updated 2 months ago
- InstructLab Community wide collaboration space including contributing, security, code of conduct, etcβ90Updated last week
- LM engine is a library for pretraining/finetuning LLMsβ55Updated this week
- codebase release for EMNLP2023 paper publicationβ19Updated 3 weeks ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β10Updated last month
- Developer documents for the InstructLab organizationβ10Updated this week
- β256Updated 6 months ago
- β66Updated last year
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develoβ¦β28Updated last week
- Core repository for an AI-powered OCP assistant serviceβ51Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated last year
- Train, tune, and infer Bamba modelβ127Updated last month
- β38Updated last month
- Benchmark structured generation librariesβ27Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β60Updated this week
- The repository contains generative AI analytics platform application code.β26Updated 3 weeks ago
- Your buddy in the (L)LM space.β64Updated 8 months ago
- Examples for building and running LLM services and applications locally with Podmanβ158Updated last week
- Estimate resources needed to train LLMsβ13Updated 3 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.β84Updated 2 months ago
- β47Updated last year