bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,290Updated this week
Alternatives and similar repositories for curator:
Users that are interested in curator are comparing it to the libraries listed below
- ☆1,017Updated 4 months ago
- Recipes to scale inference-time compute of open models☆1,066Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆881Updated last month
- A reading list on LLM based Synthetic Data Generation 🔥☆1,255Updated 2 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,671Updated last week
- Automatic evals for LLMs☆376Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,482Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,488Updated 2 months ago
- LIMO: Less is More for Reasoning☆927Updated last month
- ☆1,356Updated 5 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆512Updated last month
- Training Large Language Model to Reason in a Continuous Latent Space☆1,094Updated 3 months ago
- Fully open data curation for reasoning models☆1,742Updated 3 weeks ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,698Updated this week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆611Updated last month
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆1,132Updated 2 months ago
- procedural reasoning datasets☆573Updated this week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆839Updated last month
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,127Updated this week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,656Updated 4 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,299Updated 3 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆281Updated this week
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆808Updated this week
- Optimizing inference proxy for LLMs☆2,201Updated last week
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆425Updated 2 weeks ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,265Updated last month
- [ICLR 2025] Automated Design of Agentic Systems☆1,278Updated 3 months ago
- AllenAI's post-training codebase☆2,939Updated this week
- ☆924Updated 3 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆692Updated 3 weeks ago