Opensource benchmark evaluating web operators/agents performance
☆47Apr 11, 2025Updated last year
Alternatives and similar repositories for open-operator-evals
Users that are interested in open-operator-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Feb 23, 2025Updated last year
- Remote Components demo using Next.js App Router apps☆27Dec 9, 2025Updated 4 months ago
- MCP server for searching npm packages☆15Feb 20, 2026Updated last month
- ☆36May 21, 2025Updated 10 months ago
- AI Research Agent is a versatile application that leverages multiple tools to conduct thorough research on any topic.☆12Oct 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Jun 7, 2025Updated 10 months ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆14Mar 1, 2025Updated last year
- Climate Resilience☆15Mar 11, 2025Updated last year
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- ☆15Sep 16, 2025Updated 7 months ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- A web scraper that utilizes OpenAI Functions for easy scraping.☆10May 4, 2024Updated last year
- Source code for the website geminibyexample.com which provides simple Python code examples for the Gemini SDK☆22Apr 8, 2025Updated last year
- ☆21Sep 7, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ApertureDB Python Client☆12Jan 14, 2026Updated 3 months ago
- A2A MCP Server is a lightweight Python bridge that lets Claude Desktop or any MCP client talk to A2A agents. It provides three tools: reg…☆21May 4, 2025Updated 11 months ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- Utility functions for python data pipelines with generators.☆22Aug 30, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- ☆15Mar 6, 2024Updated 2 years ago
- ☆16Nov 18, 2024Updated last year
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆14Jun 24, 2025Updated 9 months ago
- AI Agent Tools library for Graphlit Platform☆20Jan 14, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Create a frontend using bolt.new with CrewAI API☆16Mar 21, 2025Updated last year
- ☆23Jan 5, 2026Updated 3 months ago
- adapt data to and from every format☆28Feb 15, 2026Updated 2 months ago
- ☆48Mar 23, 2026Updated 3 weeks ago
- Qelm - Quantum Enhanced Language Model☆26Updated this week
- The predecessor of CiteLab.☆18Feb 3, 2026Updated 2 months ago
- enchmarking Large Language Models' Resistance to Malicious Code☆15Dec 1, 2024Updated last year
- A framework for building language model applications.☆14Dec 8, 2023Updated 2 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Jan 30, 2026Updated 2 months ago
- A language server that handles hover information in package.json files☆20Dec 15, 2025Updated 4 months ago
- A Tavily-based agent generating evaluation datasets for web search RAG systems.☆25Jun 3, 2025Updated 10 months ago
- ☆11May 31, 2019Updated 6 years ago
- Portfolios for Creatives && Open Source Dribbble Alternative☆14Dec 28, 2025Updated 3 months ago
- MCP Server for Coda☆57Mar 31, 2026Updated 2 weeks ago
- Websockify is a WebSocket to TCP proxy/bridge. This allows a browser to connect to any application/server/service. Implementations in Py…☆30Nov 7, 2016Updated 9 years ago