reworkd / bananalyzer
Open source AI Agent evaluation framework for web tasks 🐒🍌
☆268Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for bananalyzer
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆182Updated 2 weeks ago
- LLM fine-tuning and eval☆341Updated 8 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆377Updated this week
- Automatically reformat any JSON into any schema with AI☆301Updated last month
- The open source AI app collection☆169Updated 9 months ago
- Just Another Coding Bot☆130Updated this week
- ☆190Updated 10 months ago
- ⚡️ Perplexity.ai style LLM response streaming☆140Updated 7 months ago
- ⛓️ build cognitive systems, pythonic☆326Updated this week
- Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers☆177Updated last year
- Prompt engineering, automated.☆246Updated 2 weeks ago
- Action library for AI Agent☆191Updated 2 weeks ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆341Updated 6 months ago
- Python SDK for running evaluations on LLM generated responses☆221Updated this week
- ☆114Updated 5 months ago
- Fine-tuning and serving LLMs on any cloud☆87Updated 11 months ago
- Vision utilities for web interaction agents 👀☆1,450Updated this week
- AutoNode: A Neuro-Graphic Self-Learnable Engine for Cognitive GUI Automation☆272Updated 4 months ago
- Agent accuracy measurements for LLMs☆203Updated 5 months ago
- The easiest, and fastest way to run AI-generated Python code safely☆213Updated last week
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆266Updated last week
- Infrastructure for AI code interpreting that's powering E2B.☆220Updated this week
- Lightrail Monorepo☆225Updated last year
- LLM powered retrieval engine designed to process a ton of sources to collect a comprehensive list of entities.☆320Updated 6 months ago
- Fluid Database☆114Updated 2 months ago
- LLM-ready data connectors☆62Updated 5 months ago
- Enforce structured output from LLMs 100% of the time☆241Updated 4 months ago
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆248Updated this week
- Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation☆314Updated last year
- AI-to-AI Testing | Simulation framework for LLM-based applications☆136Updated last year