Magnitude achieves SOTA 94% on WebVoyager benchmark
☆37Jul 7, 2025Updated 11 months ago
Alternatives and similar repositories for webvoyager
Users that are interested in webvoyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 11 months ago
- Generate Python docstrings automatically with LLM and syntax trees☆20Jun 13, 2025Updated 11 months ago
- A DevTools extension for the Grid Engine Phaser 3 plugin☆14Apr 13, 2022Updated 4 years ago
- Run LLMs on Replicate with vLLM☆26Jul 19, 2025Updated 10 months ago
- ☆11Jun 11, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A curated list of projects and resources using BAML☆17Aug 1, 2025Updated 10 months ago
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated last year
- Run evals using LLM☆27Jan 8, 2026Updated 5 months ago
- ☆14Feb 24, 2023Updated 3 years ago
- A lightweight library for Bayesian analysis of LLM evals (ICML 2025 Spotlight Position Paper)☆25May 28, 2025Updated last year
- Oak's AI Projects including our AI Lesson Planning Assistant (Aila)☆29Updated this week
- Chat with Uniswap v3 using natural language, powered by OpenAI Functions☆12Oct 30, 2023Updated 2 years ago
- A boilerplate project using Ionic2 with an Angular2-Meteor application.☆13Sep 12, 2018Updated 7 years ago
- Enterprise-grade Rust implementation of Anthropic's MCP protocol☆45May 16, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Business Data Benchmark (BDB) is a set of real-world questions to evaluate AI systems connected to business data.☆24Dec 3, 2024Updated last year
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- ☆27May 28, 2025Updated last year
- An agent implemented using BAML and LangGraph to do a deep research on questions and generate cited answers.☆23May 4, 2025Updated last year
- ☆17Jun 7, 2024Updated 2 years ago
- ☆30Jul 1, 2025Updated 11 months ago
- Simple, Non authoritative Benchmarks for embedded databases running in Github Actions☆10Jul 11, 2024Updated last year
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- Gherkin DSL for Ginkgo☆11Nov 15, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An implemention of GraphRAG using open source small LLMs☆14Nov 9, 2024Updated last year
- std::time, tokio::time, tokio_util::time Replacement for WASM targets.☆23Sep 1, 2025Updated 9 months ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆29Sep 5, 2024Updated last year
- A Drawful 2 clone, but with blackjack and hookers. Also it's free and supports up to 20 players.☆10Nov 7, 2017Updated 8 years ago
- A template for how you can use tldraw in a NextJs application using the app router☆42Updated this week
- ☆28May 26, 2021Updated 5 years ago
- Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics☆34Feb 10, 2025Updated last year
- A torch-based implementation of K-Means and K-Means++☆17Dec 6, 2020Updated 5 years ago
- List of programing languages that compile to Go☆56May 9, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Etherscan explorer plugin using EVM-based networks for the Ape Framework☆31Jun 1, 2026Updated last week
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- An MCP server for managing `.clinerules` files using shared components and persona templates.☆23Jan 7, 2025Updated last year
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆32Nov 18, 2024Updated last year
- Flutter SDK for Subsocial blockchain.☆11Dec 27, 2021Updated 4 years ago
- Embeddable AI voice assistant button built with LiveKit☆77May 29, 2026Updated last week
- streaming, buffered table encoder for result sets (ie from a database)☆22May 30, 2026Updated last week