EZ-hwh / AutoScraperLinks
Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']
☆485Updated last year
Alternatives and similar repositories for AutoScraper
Users that are interested in AutoScraper are comparing it to the libraries listed below
Sorting:
- Task-based Agentic Framework using StrictJSON as the core☆460Updated 2 months ago
- Implementation of Google's SELF-DISCOVER☆301Updated last year
- The easiest, and fastest way to run AI-generated Python code safely☆358Updated last year
- ☆317Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆822Updated last year
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆517Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆388Updated last year
- Structured information extraction from documents☆318Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- A simple Python sandbox for helpful LLM data agents☆305Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆473Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆318Updated 2 years ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆163Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆264Updated last year
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆418Updated 2 years ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆446Updated last year
- HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)☆457Updated 7 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆467Updated last year
- An Awesome list of curated DSPy resources.☆511Updated last month
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆148Updated last year
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆158Updated 11 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆602Updated 5 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆819Updated 6 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Updated 8 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆241Updated last year
- ☆248Updated 7 months ago
- ☆415Updated last year
- Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. F…☆218Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated 4 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆493Updated 6 months ago