bespokelabsai / curatorLinks
Synthetic data curation for post-training and structured data extraction
☆1,564Updated 4 months ago
Alternatives and similar repositories for curator
Users that are interested in curator are comparing it to the libraries listed below
Sorting:
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,956Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,141Updated last week
- Recipes to scale inference-time compute of open models☆1,118Updated 6 months ago
- Automatic evals for LLMs☆559Updated 5 months ago
- An Open Source Toolkit For LLM Distillation☆785Updated 4 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,242Updated 2 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,465Updated 5 months ago
- Fully open data curation for reasoning models☆2,152Updated 3 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆671Updated 8 months ago
- ☆1,035Updated 11 months ago
- Optimizing inference proxy for LLMs☆3,192Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,367Updated 3 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆412Updated this week
- Code and Data for Tau-Bench☆970Updated 3 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,528Updated 6 months ago
- Tool for generating high quality Synthetic datasets☆1,400Updated last month
- Environments for LLM Reinforcement Learning☆3,573Updated this week
- [COLM 2025] LIMO: Less is More for Reasoning☆1,053Updated 4 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,287Updated last week
- AllenAI's post-training codebase☆3,373Updated this week
- Arena-Hard-Auto: An automatic LLM benchmark.☆963Updated 5 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆597Updated last week
- ☆1,351Updated 2 months ago
- ☆1,141Updated last year
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆1,081Updated 3 weeks ago
- ☆1,348Updated last year
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆756Updated last week
- Build datasets using natural language☆547Updated 2 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆1,406Updated 9 months ago
- Scalable RL solution for advanced reasoning of language models☆1,779Updated 8 months ago