cantino / selectorgadget
Go go CSS / DOM inspection.
☆1,019Updated last year
Related projects ⓘ
Alternatives and complementary repositories for selectorgadget
- artoo.js - the client-side scraping companion.☆1,102Updated 3 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,150Updated last year
- Puppeteer (Headless Chrome Node API)-based rendering solution.☆527Updated 2 years ago
- Web data extraction tool implemented as chrome extension☆1,312Updated 6 years ago
- benchmark for various JavaScript libraries for generating CSS selectors☆179Updated last year
- JavaScript object that creates unique CSS selector for given element.☆537Updated last week
- Get efficient & robust CSS selectors for HTML elements☆255Updated last year
- A pure-python HTML screen-scraping library☆1,863Updated 2 years ago
- HTTP API for Scrapy spiders☆833Updated 4 months ago
- Is headless chrome currently detectable? Let's pit the detections and detection evasions against eachother.☆647Updated 3 years ago
- Extensionizr! Create a chrome extension in 15 seconds!☆1,810Updated 2 years ago
- Distributed crawler powered by Headless Chrome☆5,525Updated last year
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆434Updated 8 months ago
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)☆501Updated 4 years ago
- Puppeteer(Chrome headless node API) based web page renderer☆314Updated last month
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,526Updated 7 years ago
- Distills the DOM☆650Updated 2 years ago
- PhearJS - render dynamic Javascript webpages to JSON with PhantomJS☆327Updated 7 years ago
- A Chrome extension for writing custom web scraping programs and web automation programs. Just demonstrate how to collect the first row o…☆245Updated 6 months ago
- Distributed crawling infrastructure running on top of severless computation, cloud storage (such as S3) and sophisticated queues.☆415Updated last year
- Scrapoxy is a super proxy aggregator, allowing you to manage all proxies in one place 🎯, rather than spreading it across multiple scrape…☆2,036Updated 2 weeks ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆343Updated 6 years ago
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆101Updated 6 years ago
- Extract embedded metadata from HTML markup☆849Updated this week
- A test suite of common scraper detection techniques. See how detectable your scraper stack is.☆135Updated 2 years ago
- Work in progress transmit from Google Code☆1,109Updated 6 years ago
- A scalable frontier for web crawlers☆1,299Updated last year
- Module that extracts structured information from a rendered html site and outputs JSON. HTML to JSON.☆69Updated 3 years ago
- Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)☆205Updated 6 months ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,097Updated 3 months ago