bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆947Updated this week
Alternatives and similar repositories for curator:
Users that are interested in curator are comparing it to the libraries listed below
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi …☆2,546Updated this week
- Recipes to scale inference-time compute of open models☆1,035Updated 2 weeks ago
- Automatic evals for LLMs☆319Updated this week
- ☆1,008Updated 2 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆694Updated last week
- An Open Source Toolkit For LLM Distillation☆531Updated 2 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆952Updated last month
- A reading list on LLM based Synthetic Data Generation 🔥☆1,190Updated 3 weeks ago
- OLMoE: Open Mixture-of-Experts Language Models☆666Updated 2 months ago
- ☆826Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,279Updated this week
- Build datasets using natural language☆423Updated last week
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,552Updated 2 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆422Updated 5 months ago
- System 2 Reasoning Link Collection☆804Updated last month
- Fast State-of-the-Art Static Embeddings☆1,092Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆391Updated this week
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆648Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,472Updated last week
- Optimizing inference proxy for LLMs☆2,091Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆879Updated 2 months ago
- ☆1,340Updated 3 months ago