Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']
☆483Jan 3, 2025Updated last year
Alternatives and similar repositories for AutoScraper
Users that are interested in AutoScraper are comparing it to the libraries listed below
Sorting:
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆160Feb 11, 2025Updated last year
- An LLM-based Web Navigating Agent (KDD'24)☆929Sep 27, 2024Updated last year
- Python scraper based on AI☆22,845Feb 24, 2026Updated last week
- [ICLR 2025] Automated Design of Agentic Systems☆1,527Jan 28, 2025Updated last year
- Large Action Model framework to develop AI Web Agents☆6,311Jan 21, 2025Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Feb 15, 2024Updated 2 years ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆830Feb 3, 2025Updated last year
- ☆16Apr 30, 2024Updated last year
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆18Oct 17, 2025Updated 4 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,923Nov 25, 2024Updated last year
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,755Sep 9, 2024Updated last year
- A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-be…☆3,058Apr 24, 2025Updated 10 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆952Nov 5, 2025Updated 4 months ago
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"☆35Sep 26, 2023Updated 2 years ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆989Jul 23, 2024Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Cascade Speculative Drafting☆33Apr 2, 2024Updated last year
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆230Aug 28, 2024Updated last year
- Claude API Test Project☆86Apr 26, 2024Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,405Dec 10, 2024Updated last year
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,372Nov 26, 2025Updated 3 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,787Jul 4, 2025Updated 8 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆2,011Dec 22, 2024Updated last year
- AI Engineer website☆10Jun 22, 2023Updated 2 years ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,050Apr 25, 2025Updated 10 months ago
- A server application designed on top of MCP to interact with Cursor and MySQL.☆28Mar 23, 2025Updated 11 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,602Dec 20, 2025Updated 2 months ago
- AIOS: AI Agent Operating System☆5,287Jan 22, 2026Updated last month
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 2 years ago
- ☆15Oct 4, 2024Updated last year
- ☆21Jul 21, 2025Updated 7 months ago
- converts url content into JSON with a simple prefix☆73May 8, 2024Updated last year
- Turns an Airtable base into a WebGL knowledge graph leveraging relational columns☆34May 4, 2024Updated last year
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆852Jul 6, 2024Updated last year
- auto fine tune of models with synthetic data☆78Feb 14, 2024Updated 2 years ago
- ☆161Apr 17, 2024Updated last year
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,096May 8, 2025Updated 10 months ago