Your Company Bench: Long-horizon coherence benchmark in simulated time to test AI agent abilities to manage resources and maximize returns as a tech startup founder
☆112Jun 5, 2026Updated last week
Alternatives and similar repositories for yc-bench
Users that are interested in yc-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- MCP Atlas☆95Jun 5, 2026Updated last week
- 🎨 Single-file distributable React posters — one .tsx file, every format you'll ever need. Works as a CLI and as a library.☆67May 16, 2026Updated 3 weeks ago
- Transcripts of Democratic Debates as R Package☆10Jun 17, 2020Updated 5 years ago
- ☆10Jan 8, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- (🔥ICML2026) Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios☆35Jan 24, 2026Updated 4 months ago
- Hugo theme for documenting One-Day-Only projects☆11Jun 20, 2021Updated 4 years ago
- ☆24Aug 26, 2025Updated 9 months ago
- CLI-first runtime for Codex, Claude Code, and AI agents to operate CAE solvers via plugins: COMSOL, Abaqus, Ansys.☆133Jun 6, 2026Updated last week
- ☆28Jun 5, 2026Updated last week
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆391Aug 24, 2025Updated 9 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆22Oct 2, 2024Updated last year
- This repository is a research and educational tool intended to archive any and all available evidence of the decline in Russian military …☆28Updated this week
- A XAI Framework to provide Contrastive Whole-output Explanation for Image Classification.☆10Jul 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reusable components for AI coding agents: skills, subagents, MCP servers, and extensions.☆42Updated this week
- A TypeScript CLI wrapper for the LangGraph SDK providing command-line access to assistants, threads, and runs with comprehensive config…☆32Aug 18, 2025Updated 9 months ago
- IPython magic for simple, organized, compressed and encrypted: storage & transfer of files between notebooks.☆13Apr 13, 2026Updated last month
- Download UKB bulk data☆12Jul 27, 2020Updated 5 years ago
- Fluid Language Model Benchmarking☆30Sep 16, 2025Updated 8 months ago
- Prompt Contracts☆48Oct 19, 2025Updated 7 months ago
- Utility functions used in marimo (powered by anywidget)☆35May 14, 2026Updated 3 weeks ago
- A visual, module-based, gracefully degrading "job expression" generator for OpenFn☆12Oct 5, 2015Updated 10 years ago
- An all in one Launchy plugin.☆16Mar 17, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A demo with examples on how to write automated tests (LLM-based tests) for LLM applications.☆21Dec 1, 2023Updated 2 years ago
- Example of how to use R in Jupyter notebooks and make compatible with Binder☆17Feb 25, 2019Updated 7 years ago
- Mutable dynamic data structures for R☆18Jul 16, 2025Updated 10 months ago
- A literal cookbook. Typeset with Pandoc.☆20Apr 1, 2022Updated 4 years ago
- ☆20Sep 16, 2025Updated 8 months ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆24Feb 2, 2026Updated 4 months ago
- A MATLAB package for multi-modal voxel-wise brain image analysis☆16May 16, 2023Updated 3 years ago
- Exercises and resources for the AI Coding Summit Context Engineering remote workshop!☆27Oct 16, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆26Oct 30, 2024Updated last year
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 8 months ago
- ☆23Oct 31, 2025Updated 7 months ago
- SvelteKit (svelte v5) + Tauri V2 + FastAPI Template☆23Jul 23, 2025Updated 10 months ago
- WoW mod for Counter-Strike:Source☆14Aug 28, 2017Updated 8 years ago
- This program uses the Yahoo_finance api for python to get basic stock info for a company the user inputs, and then looks at how the compa…☆12Jul 13, 2015Updated 10 years ago
- minc_keras is a code base that was developped during a hackathon to facillitate the implementation of deep learning models for brain ima…☆11Feb 19, 2019Updated 7 years ago