Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training
☆58Jul 28, 2025Updated 8 months ago
Alternatives and similar repositories for tbench-agentic-data-pipeline
Users that are interested in tbench-agentic-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal☆43Mar 19, 2026Updated last week
- Convert GitHub PRs into Harbor tasks☆50Mar 10, 2026Updated 2 weeks ago
- ☆48Oct 28, 2025Updated 5 months ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆53Feb 23, 2026Updated last month
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆187Feb 17, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Apr 11, 2025Updated 11 months ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated 2 months ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- ☆11Sep 20, 2024Updated last year
- Deploying a custom pytorch model to AWS Sagemaker using terraform and FastAPI☆10Nov 10, 2023Updated 2 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- A benchmark for LLMs on complicated tasks in the terminal☆1,768Jan 22, 2026Updated 2 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆53Jul 15, 2025Updated 8 months ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- ☆12Nov 9, 2018Updated 7 years ago
- CAN Bus Voltage Dataset for the SIMPLE paper☆11Oct 2, 2019Updated 6 years ago
- ☆18May 20, 2025Updated 10 months ago
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- ☆14Mar 26, 2020Updated 6 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated 11 months ago
- ☆12Mar 3, 2022Updated 4 years ago
- ☆23Dec 25, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆22Feb 18, 2025Updated last year
- ☆14Dec 1, 2025Updated 3 months ago
- Lightly-reviewed collection of community environments☆219Mar 18, 2026Updated last week
- A lightweight, type-safe workflow engine for TypeScript that helps you create flexible, graph-based execution flows☆26Jun 24, 2025Updated 9 months ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆133Mar 21, 2026Updated last week
- CITE: A Corpus of Image-Text Discourse Relations☆13Apr 7, 2019Updated 6 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Mar 3, 2016Updated 10 years ago
- ☆12Feb 5, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Mar 11, 2025Updated last year
- JIRA Automation Using GPT☆23May 15, 2023Updated 2 years ago
- Exa web search tool for Vercel AI SDK. Add powerful web search tool to your AI applications in just a few lines of code.☆34Updated this week
- Showcase Azure platform’s machine learning capability to recognize document type, extract required fields and push data to downstream app…☆24Apr 27, 2023Updated 2 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Implementation of Materials Discovery with Extreme properties via AI-Driven Combinatorial Chemistry☆10May 8, 2024Updated last year
- bootstrap my zsh shell☆17Updated this week