Synthetic Data for LLM Fine-Tuning
☆125Dec 5, 2023Updated 2 years ago
Alternatives and similar repositories for pluto
Users that are interested in pluto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline☆873Updated this week
- This Terraform module provides infrastructure components for deploying Langfuse v3 self-hosted on Amazon Web Service(AWS).☆36Jul 19, 2025Updated 10 months ago
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- Fine-tuning and serving LLMs on any cloud☆91Dec 2, 2023Updated 2 years ago
- A single API for product integrations☆607Jan 17, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- LLM fine-tuning and eval☆346Mar 21, 2024Updated 2 years ago
- CLI to the Cedana Service☆58May 5, 2025Updated last year
- Open source AI Agent evaluation framework for web tasks 🐒🍌☆328Jan 1, 2025Updated last year
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆17May 7, 2024Updated 2 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- ☆29Jul 9, 2024Updated last year
- Forecastbench Datasets, updated nightly☆30Updated this week
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆23Oct 1, 2024Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Nov 12, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,114Feb 2, 2025Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆138Oct 19, 2023Updated 2 years ago
- GPT API Cost Estimation for Enterprises☆14Oct 24, 2023Updated 2 years ago
- An easy way to deploy the Langfuse observability platform to Azure Container Apps with Entra authentication.☆59Jul 28, 2025Updated 10 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆942Mar 3, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,240Jun 1, 2026Updated last week
- Evaluation repository of wikipedia index with Dria☆10Mar 14, 2024Updated 2 years ago
- Agent Skills for Langfuse, the open source LLM engineering platform for tracing, prompt management, and evaluation☆152Jun 1, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- Python SDK for FirstBatch: Real-time personalization using vectorDBs☆17Nov 26, 2023Updated 2 years ago
- ☆15Jun 12, 2024Updated last year
- This is a simple guide to help you build an Anthropic Claude Sonnet 3.5 chatbot interface with Gradio☆12Jun 23, 2024Updated last year
- awesome synthetic (text) datasets☆332Jan 8, 2026Updated 5 months ago
- ☆32Jan 1, 2024Updated 2 years ago
- RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week☆28Jul 18, 2021Updated 4 years ago
- Compression for Foundation Models☆36Jul 21, 2025Updated 10 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A minimal Rust TUI framework☆28Apr 28, 2026Updated last month
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆24Jun 22, 2022Updated 3 years ago
- 🪢 Terraform module to deploy Langfuse on Azure☆32Jun 1, 2026Updated last week
- ☆14Feb 7, 2024Updated 2 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆24Apr 26, 2025Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Dec 23, 2023Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago