Use LLMs to robustly extract web data
☆316Apr 8, 2026Updated last month
Alternatives and similar repositories for extractor
Users that are interested in extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jul 4, 2025Updated 10 months ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- JavaScript library for "fuzzy" HTML data extraction based on templates☆43Dec 25, 2025Updated 5 months ago
- Generate a map of your codebaseto help AI Agents understand your architecture, coding conventions and patterns. Discoverable with Semanti…☆44May 11, 2026Updated 2 weeks ago
- ☆24Jan 22, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Agent Skills Evaluation Framework☆51Apr 21, 2026Updated last month
- A Chrome extension for quick scraping of elements from web pages.☆28Sep 24, 2025Updated 8 months ago
- Handle file uploads to different storage services like Amazon S3, Google Cloud or etc. It also supports different type of ORM adapters, l…☆14May 20, 2026Updated last week
- Sample project to demonstrate usage of TS and dependencies☆12May 17, 2022Updated 4 years ago
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆41Jul 5, 2025Updated 10 months ago
- Wrapper around ffprobe for getting info about media files.☆16Jan 25, 2023Updated 3 years ago
- Repository for Scarf's documentation website☆10May 18, 2026Updated last week
- A general AI Agent. Inspired by Manus☆56Mar 23, 2025Updated last year
- GoodLinks Exporter☆10Jun 20, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Jan 23, 2025Updated last year
- Your Interface to Intelligence☆49Apr 23, 2026Updated last month
- Modern TypeScript automation tool for scheduling and executing prompts for AI agents with intelligent usage limit detection. Currently su…☆22Aug 17, 2025Updated 9 months ago
- A Python API to access MoneyWiz Sqlite database.☆13May 12, 2026Updated 2 weeks ago
- "fast" sqlite to parquet and csv converter☆31Nov 5, 2025Updated 6 months ago
- One-way sync files to WebDav (Rust)☆10Nov 24, 2024Updated last year
- Store Terraform state for your GitHub Actions as an encrypted artifact or repository file.☆10Jul 12, 2024Updated last year
- DispatchMail is an open source locally run (though currently using OpenAI for queries) AI-powered email assistant that helps you manage y…☆91Sep 26, 2025Updated 8 months ago
- MCP server for orchestrating Claude Code or Codex sessions via iTerm2 or tmux☆43Apr 25, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Perplexity style AI answer engine for AI PCs with CPU,GPU and NPU support☆50Mar 1, 2026Updated 2 months ago
- The open-source software factory — multi-agent fleet management for coding agents☆57Updated this week
- ☆26Apr 21, 2026Updated last month
- Provides a frame iterator for videos by using ffmpeg. Decodes images using the image crate.☆12Mar 31, 2021Updated 5 years ago
- SaaS in a box: A lean boilerplate for rapidly launching your product. Built with next.js+prisma+supabase+stripe+next-auth+shadcn+tailwi…☆28Dec 24, 2024Updated last year
- Offline MapLibre + PMTiles + Valhalla + geocoder in Docker Compose (Monaco demo)☆25Feb 19, 2026Updated 3 months ago
- Multi-model transactional embedded database☆67Dec 10, 2024Updated last year
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Apr 22, 2026Updated last month
- Use claude code with openrouter or any other openai compatible endpoint☆23Jul 23, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A nice keyboard-oriented homepage, designed by committee^Wspec.☆13Jun 25, 2025Updated 11 months ago
- Historical Language Model for London - A specialized LLM trained on 1500-1850 historical English text☆30Nov 1, 2025Updated 6 months ago
- Astrix Security MCP Secret Wrapper☆49May 8, 2026Updated 3 weeks ago
- The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning☆27Jul 27, 2025Updated 10 months ago
- A Docker Stack deployment for the monitoring suite for Docker Swarm includes (Grafana, Prometheus, cAdvisor, Node exporter and Blackbox p…☆13May 8, 2025Updated last year
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago
- Self hosted audiobook library to podcast rss☆23Aug 5, 2023Updated 2 years ago