Your Company Bench: Long-horizon coherence benchmark in simulated time to test AI agent abilities to manage resources and maximize returns as a tech startup founder
☆82Apr 25, 2026Updated this week
Alternatives and similar repositories for yc-bench
Users that are interested in yc-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- MCP Atlas☆71Apr 21, 2026Updated last week
- 🧜♀️ Pi extension that renders Mermaid diagrams as ASCII in the TUI, with width-aware output and safe handling for larger diagrams.☆50Feb 23, 2026Updated 2 months ago
- walterra's collections of helpers for agentic coding☆32Mar 23, 2026Updated last month
- Transcripts of Democratic Debates as R Package☆10Jun 17, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Jan 8, 2025Updated last year
- ☆24Jun 5, 2025Updated 10 months ago
- 🔧🔌 Prototype for programmatically calling and composing MCP tools☆41Feb 23, 2026Updated 2 months ago
- Hugo theme for documenting One-Day-Only projects☆11Jun 20, 2021Updated 4 years ago
- ☆18Nov 25, 2024Updated last year
- ☆24Aug 26, 2025Updated 8 months ago
- Short course using RStudio for biological data analysis☆14Jul 7, 2022Updated 3 years ago
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆374Aug 24, 2025Updated 8 months ago
- ☆28Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Oct 2, 2024Updated last year
- This repository is a research and educational tool intended to archive any and all available evidence of the decline in Russian military …☆25Updated this week
- Is a simple pytest plugin for testing async python code☆15Feb 12, 2026Updated 2 months ago
- Resources to learn data processing with GPT and other language models☆21Dec 10, 2024Updated last year
- LangChain + llamaCPP + babyAGI implementation☆13Apr 12, 2023Updated 3 years ago
- Reusable components for AI coding agents: skills, subagents, MCP servers, and extensions.☆41Updated this week
- Fluid Language Model Benchmarking☆28Sep 16, 2025Updated 7 months ago
- A TypeScript CLI wrapper for the LangGraph SDK providing command-line access to assistants, threads, and runs with comprehensive config…☆33Aug 18, 2025Updated 8 months ago
- IPython magic for simple, organized, compressed and encrypted: storage & transfer of files between notebooks.☆13Apr 13, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545☆51Jun 27, 2024Updated last year
- Prompt Contracts☆48Oct 19, 2025Updated 6 months ago
- Utility functions used in marimo (powered by anywidget)☆34Apr 6, 2026Updated 3 weeks ago
- Example of how to use R in Jupyter notebooks and make compatible with Binder☆17Feb 25, 2019Updated 7 years ago
- Mutable dynamic data structures for R☆18Jul 16, 2025Updated 9 months ago
- ☆16Dec 11, 2017Updated 8 years ago
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆23Feb 2, 2026Updated 2 months ago
- A MATLAB package for multi-modal voxel-wise brain image analysis☆16May 16, 2023Updated 2 years ago
- Script that looks at the YAML metadata in a markdown file and runs pandoc for you.☆12Nov 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Exercises and resources for the AI Coding Summit Context Engineering remote workshop!☆27Oct 16, 2025Updated 6 months ago
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆21Oct 30, 2024Updated last year
- ☆23Oct 31, 2025Updated 6 months ago
- SvelteKit (svelte v5) + Tauri V2 + FastAPI Template☆22Jul 23, 2025Updated 9 months ago
- An MCP server that provides real-time football data based on the SoccerDataAPI.☆30May 14, 2025Updated 11 months ago
- Implementation of Recursive Language Model paper from scratch☆43Feb 10, 2026Updated 2 months ago
- A Model Context Protocol (MCP) server that lets your AI interact with Yahoo Finance to get comprehensive stock market data, news, financi…☆44Mar 28, 2026Updated last month