Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.
☆70Jun 8, 2021Updated 4 years ago
Alternatives and similar repositories for struktur
Users that are interested in struktur are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆439Dec 30, 2022Updated 3 years ago
- ☆116Mar 16, 2024Updated 2 years ago
- Web Page Inspection Tool UI. Article Summary, Sentiment Analysis, Keyword Extraction, Named Entity Recognition & Spell Check☆24Sep 29, 2025Updated 6 months ago
- 🧱 A uniform template to use as a foundation for Puppeteer bot construction.☆69May 6, 2021Updated 4 years ago
- ☆12May 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Node.js implementation of NUID☆14Mar 25, 2026Updated 3 weeks ago
- Fast extraction of all external links from wikipedia☆13Sep 22, 2018Updated 7 years ago
- Statistical WHOIS parser☆10Apr 17, 2017Updated 9 years ago
- The chrome browser controlled via puppeteer does not support switching proxies without restarting the browser. In this tutorial I show ho…☆12Dec 20, 2020Updated 5 years ago
- Scraping workshop☆16Nov 21, 2016Updated 9 years ago
- Colorize all the photos in a directory☆15May 26, 2021Updated 4 years ago
- Run Chrome from the Terminal☆18Aug 9, 2024Updated last year
- 📡 expose browser devtools port publicly with TLS and authentication.☆18Sep 10, 2024Updated last year
- Lightweight JavaScript library to interact with Chromium-based browsers via the Chrome DevTools Protocol☆27May 12, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Install and configure selinux and its required libraries on your system.☆11Mar 18, 2026Updated last month
- Simple tool to test service worker fingerprint leaks in headless browser with puppeteer-extra-plugin-stealth☆19Dec 19, 2025Updated 4 months ago
- A modernised version of the SIPml5 WebRTC library☆11Nov 20, 2024Updated last year
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.☆64Jul 23, 2023Updated 2 years ago
- NodeJS library without any external dependencies to check if free HTTP/SOCKS4/SOCKS5 proxies are working/up☆27Apr 10, 2022Updated 4 years ago
- Google Chrome release and version info as JSON (self updating)☆59Updated this week
- ☆30Jun 10, 2024Updated last year
- Small google play api wrapper in go.☆14Nov 25, 2022Updated 3 years ago
- How to detect puppeteer with 100% accuracy☆109May 30, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- In-Memory Key-Value Database with Persistent File Storage☆16Sep 24, 2022Updated 3 years ago
- 🛡🎭 A conceptual patch which modifies some vanilla puppeteer files to decrease detection rates.☆56Mar 6, 2021Updated 5 years ago
- Undetected version of the main playwright implementation (NodeJS)☆26Jan 1, 2024Updated 2 years ago
- Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chain…☆989Feb 17, 2026Updated 2 months ago
- ☆17Sep 27, 2022Updated 3 years ago
- Auto generate cool code based clothing [WIP]☆11Sep 30, 2019Updated 6 years ago
- 😈📚 A curated library of research papers and presentations for counter-detection and web privacy enthusiasts.☆737Feb 19, 2024Updated 2 years ago
- Article content extraction database☆40Mar 1, 2023Updated 3 years ago
- Automatically extracts structured information from webpages☆111Jun 23, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Matrix bot to act as a middleman☆27Jan 28, 2025Updated last year
- Simple boilerplate with fastify, knex and graphql☆12Jan 11, 2023Updated 3 years ago
- CSV grooming, the JS way☆21Jul 8, 2019Updated 6 years ago
- Subscriber Only is a Jekyll plugin that, with a Ruby gem and a couple lines of YAML, enables subscription management, payment processing …☆13Aug 11, 2023Updated 2 years ago
- A web-based editor for making mini svg-like graphics for your js13k entry☆13Sep 2, 2018Updated 7 years ago
- Passive TCP/IP Fingerprinting Tool. Run this on your server and find out what Operating Systems your clients are *really* using.☆412Mar 7, 2026Updated last month
- [PHP] Lightweight proxy with full support for sessions, cookies, POST/FORM submissions, and URL rewriting. The proxy offers two methods o…☆20Aug 26, 2024Updated last year