Framework for creating high fidelity and complex RL environments and evaluation tasks
☆232May 21, 2026Updated this week
Alternatives and similar repositories for benchflow
Users that are interested in benchflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AgentBudget is the ulimit for AI agents. Just like Unix systems have ulimit to prevent a single process from consuming all system resourc…☆104Apr 5, 2026Updated last month
- The SAIL blog☆13May 11, 2026Updated 2 weeks ago
- ☆10Jun 24, 2024Updated last year
- ☆16Jul 17, 2025Updated 10 months ago
- A Python library for LLM-based evaluation using weighted rubrics.☆63Feb 3, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆19Updated this week
- A curated list of awesome Harbor ecosystem projects☆37Apr 5, 2026Updated last month
- Packer plugin for Lume☆13Apr 19, 2025Updated last year
- ☆16Apr 8, 2026Updated last month
- DIY simulacra—build and run your own simulation. 🤖🌎☆26Jul 21, 2023Updated 2 years ago
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆41Apr 29, 2024Updated 2 years ago
- AI Chatbot Starter Kit: An open-source, extensible framework for rapidly developing custom AI chatbots with integrations for popular data…☆22Aug 7, 2024Updated last year
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 【Star us and watch this project grow! 🌱⭐️】A Spring Boot-based e-commerce microservices template with comprehensive setup guides. Ideal f…☆21Apr 7, 2026Updated last month
- ClawSync, OpenClaw for the cloud. Deploy an open source personal AI agent with chat UI, skills system, MCP support, and multi-model routi…☆70Feb 11, 2026Updated 3 months ago
- ☆47Apr 11, 2024Updated 2 years ago
- Export contacts from the macOS Contacts app in vCard format to Markdown files with structured data.☆17Jul 8, 2024Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆38Apr 1, 2025Updated last year
- 🧠 A sample app to integrate react-native and open ai☆11Jan 1, 2023Updated 3 years ago
- Stampy's copy of Alignment Research Dataset scraper☆24May 12, 2026Updated 2 weeks ago
- ☆13Jun 21, 2023Updated 2 years ago
- Personal project, Generative AI, Streamlit, Python☆53Apr 30, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A powerful integration that combines Browserbase's Stagehand with Mastra for advanced web automation, scraping, and AI-powered web intera…☆40May 11, 2026Updated 2 weeks ago
- ChatGPT-like Application using RAG pattern that allows to ask question to my own documents - I Used Semantic Kernel to integrate a LLM (…☆13Mar 16, 2024Updated 2 years ago
- Update a binary to its latest version by using the original package manager that was used to install it☆22Aug 31, 2025Updated 8 months ago
- Repository for my studies of Causal Inference☆10Dec 1, 2019Updated 6 years ago
- An enhanced TypeScript SDK for OpenAI API with built-in context management, proxy support, streaming, and enhanced error handling. Includ…☆12Oct 11, 2024Updated last year
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- Cybersecurity Ontology (CyberOnto) and Situational Awareness (CyberSA) help teamwork in Cyber Incident Responses, Control, Containment, a…☆10Sep 15, 2022Updated 3 years ago
- ☆27Jan 14, 2025Updated last year
- Excel MCP Server - Manipulate Excel files without Microsoft Excel. Model Context Protocol for XLSX, XLSM with Claude AI integration☆28Jun 18, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- MCP server empowering AI assistants with real-world capabilities: Gmail, Calendar, Tasks, Git integration, and note management. Bridges A…☆12Jun 28, 2025Updated 10 months ago
- Transform your CapsLock into an AI key! This AutoHotkey app puts powerful AI capabilities right at your fingertips, supercharging your Wi…☆21Oct 31, 2025Updated 6 months ago
- Small, simple agent task environments for training and evaluation☆19Nov 1, 2024Updated last year
- ☆285May 18, 2026Updated last week
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆281Jul 7, 2025Updated 10 months ago
- Patient Intake Form Extraction using llm☆16May 29, 2025Updated 11 months ago
- Jupyter kernel integration for Backend.AI☆10Nov 9, 2018Updated 7 years ago