EZ-hwh / AutoScraper
Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']
☆463Updated 4 months ago
Alternatives and similar repositories for AutoScraper:
Users that are interested in AutoScraper are comparing it to the libraries listed below
- Structured information extraction from documents☆315Updated 7 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆341Updated 10 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆742Updated 3 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆464Updated last year
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆483Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆263Updated last year
- An Awesome list of curated DSPy resources.☆313Updated 2 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆369Updated last year
- ☆868Updated 7 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆381Updated last year
- AWM: Agent Workflow Memory☆269Updated 3 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"☆820Updated last month
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆315Updated last year
- An Intelligence Operating System☆324Updated this week
- Task-based Agentic Framework using StrictJSON as the core☆450Updated 2 weeks ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆146Updated 2 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 8 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆425Updated 3 weeks ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆201Updated 2 weeks ago
- A simple Python sandbox for helpful LLM data agents☆257Updated 10 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆279Updated 2 weeks ago
- ☆496Updated 8 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆981Updated 3 months ago
- ☆222Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆421Updated last year
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆773Updated 2 months ago
- Implementation of Google's SELF-DISCOVER☆295Updated 9 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,400Updated last month
- This repository implements the chain of verification paper by Meta AI☆168Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 2 weeks ago