Halluminate / WebBenchLinks
📚 Benchmark your browser agent on ~2.5k READ and ACTION based tasks
☆61Updated last month
Alternatives and similar repositories for WebBench
Users that are interested in WebBench are comparing it to the libraries listed below
Sorting:
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆86Updated 2 weeks ago
- 🤖 Headless IDE for AI agents☆201Updated 5 months ago
- ☆104Updated 3 months ago
- A toolkit for building computer use AI agents☆177Updated 3 months ago
- Routing on Random Forest (RoRF)☆206Updated last year
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆41Updated last month
- Build AI Agents with Your Existing Python Code!☆65Updated 11 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated this week
- ☆47Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year
- ☆89Updated 8 months ago
- proof-of-concept of Cursor's Instant Apply feature☆83Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆55Updated 4 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- MarinaBox is a toolkit for creating and managing secure, isolated environments for AI agents☆136Updated 7 months ago
- Open-Source AI-powered web browser. Browse the web with your own LLM API key. Alternative to Dia / Comet.☆77Updated 2 months ago
- ☆32Updated last month
- Embed anything.☆28Updated last year
- A framework for optimizing DSPy programs with RL☆182Updated this week
- ☆113Updated 2 months ago
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆18Updated last year
- Deprecated Browserbase Python SDK☆11Updated 10 months ago
- Anthropic Computer Use with Modal Sandboxes☆37Updated 11 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 6 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 3 months ago
- The next evolution of Agents☆47Updated this week
- ☆68Updated 4 months ago
- Build Web Datasets with Ease☆33Updated last year
- ☆143Updated 7 months ago
- ☆73Updated 3 weeks ago