EZ-hwh / AutoScraperLinks
Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']
☆470Updated 5 months ago
Alternatives and similar repositories for AutoScraper
Users that are interested in AutoScraper are comparing it to the libraries listed below
Sorting:
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆754Updated 4 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆464Updated last year
- Implementation of Google's SELF-DISCOVER☆294Updated 10 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆212Updated 3 weeks ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api☆1,134Updated 3 weeks ago
- ☆299Updated last year
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆388Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆836Updated 2 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,033Updated 4 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆150Updated 4 months ago
- Code for Quiet-STaR☆734Updated 10 months ago
- A simple Python sandbox for helpful LLM data agents☆267Updated last year
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆306Updated 2 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆315Updated last year
- Official repo for "Make Your LLM Fully Utilize the Context"☆252Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆377Updated last year
- Structured information extraction from documents☆315Updated 8 months ago
- Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"☆827Updated last year
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆490Updated last year
- Task-based Agentic Framework using StrictJSON as the core☆452Updated 2 weeks ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆465Updated last week
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆421Updated 2 weeks ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆239Updated 4 months ago
- Automated Evaluation of RAG Systems☆613Updated 2 months ago
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆952Updated 2 months ago
- ☆608Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆425Updated last year
- AWM: Agent Workflow Memory☆275Updated 4 months ago