bespokelabsai / curatorLinks
Synthetic data curation for post-training and structured data extraction
β1,352Updated last week
Alternatives and similar repositories for curator
Users that are interested in curator are comparing it to the libraries listed below
Sorting:
- A reading list on LLM based Synthetic Data Generation π₯β1,280Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learningβ1,019Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,712Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,563Updated last week
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.β622Updated 2 months ago
- Optimizing inference proxy for LLMsβ2,427Updated this week
- Recipes to scale inference-time compute of open modelsβ1,073Updated last week
- Automatic evals for LLMsβ399Updated this week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,120Updated 4 months ago
- procedural reasoning datasetsβ603Updated last week
- Fully open data curation for reasoning modelsβ1,793Updated last week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β1,876Updated this week
- Search-o1: Agentic Search-Enhanced Large Reasoning Modelsβ892Updated 2 weeks ago
- Code and Data for Tau-Benchβ528Updated 4 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ2,356Updated last week
- β1,020Updated 5 months ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,311Updated 4 months ago
- LIMO: Less is More for Reasoningβ944Updated last month
- Evaluate your LLM's response with Prometheus and GPT4 π―β948Updated last month
- Tool for generating high quality Synthetic datasetsβ878Updated last week
- An Open Source Toolkit For LLM Distillationβ612Updated last month
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,699Updated 5 months ago
- Agent Reinforcement Trainer for training multi-turn agents using GRPOβ596Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,385Updated 4 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learningβ873Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddingsβ1,688Updated this week
- π€ Benchmark Large Language Models Reliably On Your Dataβ315Updated this week
- Democratizing Reinforcement Learning for LLMsβ3,291Updated 2 weeks ago
- β1,354Updated 6 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data β¦β705Updated 2 months ago