Magnitude achieves SOTA 94% on WebVoyager benchmark
☆37Jul 7, 2025Updated 10 months ago
Alternatives and similar repositories for webvoyager
Users that are interested in webvoyager are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Call BAML functions from Elixir☆56Mar 13, 2026Updated 2 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 10 months ago
- Generate Python docstrings automatically with LLM and syntax trees☆20Jun 13, 2025Updated 11 months ago
- Create and manage isolated Git worktrees for AI coding agents.☆30Mar 3, 2026Updated 2 months ago
- ☆11Jun 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of projects and resources using BAML☆17Aug 1, 2025Updated 9 months ago
- NeurIPS 2024: SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation☆13May 24, 2025Updated 11 months ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆12Jan 30, 2020Updated 6 years ago
- Run evals using LLM☆27Jan 8, 2026Updated 4 months ago
- Turn television drama into storyworld knowledge graphs☆29Apr 19, 2025Updated last year
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- A very simple cross-service LLM API for Python☆23Nov 30, 2023Updated 2 years ago
- Enterprise-grade Rust implementation of Anthropic's MCP protocol☆44Updated this week
- Business Data Benchmark (BDB) is a set of real-world questions to evaluate AI systems connected to business data.☆24Dec 3, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of network-related python utilities.☆17Sep 8, 2023Updated 2 years ago
- Install XFCE and Chrome Remote desktop on Ubuntu using golang or shellscript☆21Oct 3, 2025Updated 7 months ago
- An agent implemented using BAML and LangGraph to do a deep research on questions and generate cited answers.☆22May 4, 2025Updated last year
- ☆17Jun 7, 2024Updated last year
- ☆30Jul 1, 2025Updated 10 months ago
- Simple, Non authoritative Benchmarks for embedded databases running in Github Actions☆10Jul 11, 2024Updated last year
- VS Code extension for editing Accord Project artifacts☆15Feb 24, 2023Updated 3 years ago
- Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024☆22Feb 15, 2024Updated 2 years ago
- Code for the paper: Prompts have evil twins (EMNLP 2024)☆24Feb 10, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An implemention of GraphRAG using open source small LLMs☆14Nov 9, 2024Updated last year
- Auto-generate Next.js 14 UI from your Prisma Schema in seconds☆22Updated this week
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆29Sep 5, 2024Updated last year
- A multi-hypervisor VM runtime for OCI images, supporting Cloud Hypervisor, Firecracker, QEMU, and Apple Virtualization.framework.☆92May 12, 2026Updated last week
- ☆45Updated this week
- Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics☆32Feb 10, 2025Updated last year
- Baruch MFE program quant lab☆17Feb 19, 2017Updated 9 years ago
- A torch-based implementation of K-Means and K-Means++☆17Dec 6, 2020Updated 5 years ago
- Production-ready go-mail fork. The best way to send emails in Go.☆17Apr 16, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GameStream client for Android☆35Feb 11, 2025Updated last year
- Official repo of paper LM2☆48Feb 13, 2025Updated last year
- An MCP server for managing `.clinerules` files using shared components and persona templates.☆23Jan 7, 2025Updated last year
- Bringing some SQL to Qdrant☆19Jun 17, 2025Updated 11 months ago
- The core of open telemetry instrumentation is the OpenTelemetry API/SDK. The initial aim of this shard is to implement the OpenTelemetry …☆15Jan 7, 2025Updated last year
- Review-Driven Safe AI Coding☆56Apr 15, 2026Updated last month
- Code for the paper "Trust the PRoC3S: Solving Long-Horizon Robotics Problems with LLMs and Constraint Satisfaction" presented at CoRL 202…☆31Nov 18, 2024Updated last year