Synthetic Data for LLM Fine-Tuning
☆121Dec 5, 2023Updated 2 years ago
Alternatives and similar repositories for pluto
Users that are interested in pluto are comparing it to the libraries listed below
Sorting:
- Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline☆836Feb 16, 2026Updated last week
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆25Jun 22, 2022Updated 3 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- This Terraform module provides infrastructure components for deploying Langfuse v3 self-hosted on Amazon Web Service(AWS).☆35Jul 19, 2025Updated 7 months ago
- ☆32Jan 1, 2024Updated 2 years ago
- ☆10Jul 15, 2024Updated last year
- A flat container abstraction for Rust☆16Nov 24, 2025Updated 3 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆29Feb 9, 2026Updated 2 weeks ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆33Updated this week
- 🪢 Terraform module to deploy Langfuse on GCP☆27Jan 23, 2026Updated last month
- Chatbot built with API.AI in Salesforce Lightning☆15May 30, 2017Updated 8 years ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated last year
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,095Feb 2, 2025Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Pre-training BART model for the Italian Language☆16Dec 28, 2022Updated 3 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,100Feb 16, 2026Updated last week
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 10 months ago
- 🪢 Terraform module to deploy Langfuse on Azure☆31Dec 12, 2025Updated 2 months ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆21Jul 24, 2023Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆137Oct 19, 2023Updated 2 years ago
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- ☆84Nov 10, 2025Updated 3 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"☆24Aug 10, 2024Updated last year
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆942Mar 3, 2024Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated last year
- 🦄 Use GPT to generate and label data☆25Apr 30, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- ☆97Dec 16, 2024Updated last year
- Neuron Activation☆26Nov 21, 2024Updated last year
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year