Your Company Bench: Long-horizon coherence benchmark in simulated time to test AI agent abilities to manage resources and maximize returns as a tech startup founder
☆117Jun 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for yc-bench
Users that are interested in yc-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 5 months ago
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- MCP Atlas☆102Jun 17, 2026Updated 2 weeks ago
- 🎨 Single-file distributable React posters — one .tsx file, every format you'll ever need. Works as a CLI and as a library.☆69May 16, 2026Updated last month
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- (🔥ICML2026) Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios☆36Jan 24, 2026Updated 5 months ago
- ☆25Jun 5, 2025Updated last year
- Hugo theme for documenting One-Day-Only projects☆11Jun 20, 2021Updated 5 years ago
- ☆20Nov 25, 2024Updated last year
- ☆22Apr 24, 2025Updated last year
- ☆24Aug 26, 2025Updated 10 months ago
- Short course using RStudio for biological data analysis☆14Jul 7, 2022Updated 3 years ago
- CLI-first runtime for Codex, Claude Code, and AI agents to operate CAE solvers via plugins: COMSOL, Abaqus, Ansys.☆155Jun 6, 2026Updated 3 weeks ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆22Oct 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Soul-grounded Minecraft social simulation runtime where Mineflayer actors pursue LifeGoals through evidence-backed action skills and tr…☆24Jun 18, 2026Updated 2 weeks ago
- Is a simple pytest plugin for testing async python code☆15Feb 12, 2026Updated 4 months ago
- Resources to learn data processing with GPT and other language models☆21Dec 10, 2024Updated last year
- Freesurfer Port to R☆10Jun 8, 2026Updated 3 weeks ago
- LangChain + llamaCPP + babyAGI implementation☆13Apr 12, 2023Updated 3 years ago
- Visualization of WhatsApp chat history data☆10Jan 31, 2016Updated 10 years ago
- ☆19Jan 24, 2025Updated last year
- A TypeScript CLI wrapper for the LangGraph SDK providing command-line access to assistants, threads, and runs with comprehensive config…☆33Aug 18, 2025Updated 10 months ago
- Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545☆53Jun 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fluid Language Model Benchmarking☆29Sep 16, 2025Updated 9 months ago
- Utility functions used in marimo (powered by anywidget)☆37May 14, 2026Updated last month
- A visual, module-based, gracefully degrading "job expression" generator for OpenFn☆12Oct 5, 2015Updated 10 years ago
- An all in one Launchy plugin.☆16Mar 17, 2021Updated 5 years ago
- A demo with examples on how to write automated tests (LLM-based tests) for LLM applications.☆21Dec 1, 2023Updated 2 years ago
- a library which can be used to create story driven clustered load-testing packages through a very readable and understandable api.☆30May 20, 2010Updated 16 years ago
- Mutable dynamic data structures for R☆18Jul 16, 2025Updated 11 months ago
- ☆20Sep 16, 2025Updated 9 months ago
- Various CTF challenge solutions☆12Apr 20, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆25Jun 17, 2026Updated 2 weeks ago
- Develop software using AI Agent teams (CrewAI framework)☆18Jul 3, 2025Updated last year
- The code and dataset for Boundary Representation Transformer☆26Dec 8, 2025Updated 6 months ago
- My presentation on Cyber Grand Challenge and DEFCON 24 CTF at SHLUG monthly meeting☆13Sep 24, 2016Updated 9 years ago
- A production-grade implementation of an Investment Portfolio Management System created for testing LLM translation of real world legacy a…☆27Oct 30, 2024Updated last year
- Exercises and resources for the AI Coding Summit Context Engineering remote workshop!☆27Oct 16, 2025Updated 8 months ago
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 8 months ago