instructlab / sdgLinks
Python library for Synthetic Data Generation
☆51Updated 3 weeks ago
Alternatives and similar repositories for sdg
Users that are interested in sdg are comparing it to the libraries listed below
Sorting:
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆44Updated last week
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆54Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆62Updated 3 months ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆212Updated 2 weeks ago
- Python library for Evaluation☆16Updated last week
- Taxonomy tree that will allow you to create models tuned with your data☆287Updated 3 months ago
- codebase release for EMNLP2023 paper publication☆19Updated 3 months ago
- ☆43Updated last year
- ☆267Updated 6 months ago
- Train, tune, and infer Bamba model☆137Updated 6 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆123Updated 2 months ago
- Synthetic Data Generation Toolkit for LLMs☆80Updated last week
- Ongoing research training transformer models at scale☆42Updated 2 weeks ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- ☆50Updated 2 months ago
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆28Updated last week
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆55Updated last year
- LM engine is a library for pretraining/finetuning LLMs☆102Updated this week
- ☆59Updated last month
- GitHub bot to assist with the taxonomy contribution workflow☆17Updated last year
- ☆52Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- ☆31Updated last year
- InstructLab Community wide collaboration space including contributing, security, code of conduct, etc☆92Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆71Updated last year
- A massively multilingual modern encoder language model☆117Updated 2 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago