A curated list of promising Web Data Extractors resources
☆30Dec 24, 2019Updated 6 years ago
Alternatives and similar repositories for awesome-web-data-extractor
Users that are interested in awesome-web-data-extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Feb 26, 2024Updated 2 years ago
- A simple shell executer script that kills processes that run for too long☆23Aug 7, 2012Updated 13 years ago
- OpenAI ChatGPT Laravel package. Laragpt is the perfect package for developers wanting to access the powerful Artificial Intelligence capa…☆17Jan 15, 2024Updated 2 years ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Template matching ocr using scanlines and templates. Accuracy more than 80%. Need to improve accuracy and small character recognition mor…☆18Oct 6, 2017Updated 8 years ago
- experimental wildcard subdomain filtering prototype☆14Aug 5, 2023Updated 2 years ago
- Brevo Webhook Manager CLI Tool for Laravel☆20Apr 21, 2026Updated last month
- A collection of awesome web crawler,spider in different languages☆12Oct 6, 2017Updated 8 years ago
- CVE-2021-40438 exploit PoC with Docker setup.☆12Oct 24, 2021Updated 4 years ago
- List of .tr domains, contains 200k live domains.☆16Mar 2, 2023Updated 3 years ago
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆36Oct 28, 2025Updated 7 months ago
- ☆12Updated this week
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Collections of Tools, Bookmarks, and other guides created to aid in OSINT collection☆18Aug 18, 2021Updated 4 years ago
- Manage your Synology Download Station from your terminal☆10Jan 7, 2023Updated 3 years ago
- A Node.js REPL with built-in GPT3 completion☆13Feb 18, 2023Updated 3 years ago
- ☆12Jan 12, 2016Updated 10 years ago
- HTTP testing platform for security researchers☆33May 16, 2026Updated last week
- Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.☆37May 8, 2026Updated 3 weeks ago
- Tinyproxy eXit gateway to clearweb / yggdrasil, i2p, tor, and also bypass internet blocking in Russia via TOR.☆26Jul 20, 2021Updated 4 years ago
- ☆15May 13, 2026Updated 2 weeks ago
- Unofficial php interface to the Baserow.io API☆15Apr 27, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Datasette plugin for inserting and updating data☆20Mar 29, 2024Updated 2 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- Word-level sequence to sequence RNN for translation☆10Apr 7, 2017Updated 9 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- A framework for understanding the capabilities of automated detection methods at identifying classes of application security vulnerabilit…☆33Apr 27, 2026Updated last month
- ☆11Jul 18, 2022Updated 3 years ago
- Internal routines for the gonum project [DEPRECATED]☆21Nov 24, 2018Updated 7 years ago
- Render a map for any query with a geometry column☆29Aug 10, 2024Updated last year
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19May 14, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- Template repository and README for submissions to Bellingcat's Global Hackathon☆16Oct 7, 2022Updated 3 years ago
- Parallel Tar☆15Oct 31, 2019Updated 6 years ago
- A framework for creating digital exhibits by loading collection metadata directly from a CSV (such as a published Google Sheet!). See the…☆14Feb 20, 2026Updated 3 months ago
- Robotic Arm learns to approach objects using Deep Reinforcement Learning☆12Jun 21, 2023Updated 2 years ago
- xhprof composer☆12Apr 6, 2022Updated 4 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago