Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']
☆483Jan 3, 2025Updated last year
Alternatives and similar repositories for AutoScraper
Users that are interested in AutoScraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An LLM-based Web Navigating Agent (KDD'24)☆934Sep 27, 2024Updated last year
- Python scraper based on AI☆23,249Updated this week
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆842Feb 3, 2025Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆53Feb 27, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Large Action Model framework to develop AI Web Agents☆6,318Jan 21, 2025Updated last year
- [ICLR 2025] Automated Design of Agentic Systems☆1,551Jan 28, 2025Updated last year
- Claude API Test Project☆86Apr 26, 2024Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆988Jul 23, 2024Updated last year
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,925Nov 25, 2024Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆972Nov 5, 2025Updated 5 months ago
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,064Apr 24, 2025Updated 11 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,761Sep 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆229Aug 28, 2024Updated last year
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"☆35Sep 26, 2023Updated 2 years ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,824Jul 4, 2025Updated 9 months ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆22Oct 13, 2024Updated last year
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,031Dec 22, 2024Updated last year
- ☆16Apr 30, 2024Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Dec 10, 2024Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,063Apr 25, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆2,733Apr 2, 2026Updated last week
- ☆12Dec 20, 2024Updated last year
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆4,745Nov 18, 2024Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,418Nov 26, 2025Updated 4 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆475Mar 19, 2024Updated 2 years ago
- ☆15Oct 4, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆41,858Apr 2, 2026Updated last week
- An autonomous agent that conducts deep research on any data using any LLM providers☆26,202Mar 14, 2026Updated 3 weeks ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 5 months ago
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆850Jul 6, 2024Updated last year
- AIOS: AI Agent Operating System☆5,461Jan 22, 2026Updated 2 months ago
- The first "code-first" agent framework for seamlessly planning and executing data analytics tasks.☆6,135Mar 23, 2026Updated 2 weeks ago
- Structured Outputs☆13,631Mar 26, 2026Updated 2 weeks ago
- The Open Source Memory Layer For Autonomous Agents☆2,574Oct 22, 2024Updated last year