apify / actor-page-analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
☆150Updated 2 years ago
Alternatives and similar repositories for actor-page-analyzer
Users that are interested in actor-page-analyzer are comparing it to the libraries listed below
Sorting:
- OpenFaaS template for headless Chrome and Puppeteer☆91Updated last year
- File-system-based database (in the git repo), with a server attached with users and access control for serving this data. See an example …☆63Updated 2 years ago
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆103Updated 6 years ago
- An OPML file with 22 of the top 25 US newspapers RSS feeds☆55Updated 6 years ago
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- Twitter AI Platform☆93Updated 7 years ago
- midas is a framework that enables you to enrich your CSV, JSON or Excel dataset with any web API you can think of.☆53Updated 6 years ago
- An interactive demo walk-through we built to give visitors a feel for what the Trevor.io platform does☆253Updated 5 years ago
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆352Updated 3 months ago
- Track clicks and other client-side events on web pages☆225Updated 7 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆63Updated 6 years ago
- Dashboard is software for creating web apps and SaaS (support @ freenode #userdashboard)☆282Updated 4 years ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆202Updated last week
- type with your voice on Mac/Windows/Linux using electronjs and google chrome☆41Updated 4 years ago
- unformatted text > parse/clean it > get relevant info☆52Updated 6 years ago
- 📮 Dialogflow + Sendgrid = AI Mailbox☆35Updated 4 years ago
- Scrapy rotation proxy package with advanced functions☆95Updated 2 years ago
- get more done with fewer keystrokes☆57Updated 2 years ago
- A simple script that will download the favorites for the provided hacker news user id☆69Updated 2 years ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Chrome Extension to accompany the memex.☆44Updated 4 years ago
- Sheetsu Web Client☆177Updated 7 years ago
- Backup of seventag when it was still open source☆42Updated 6 years ago
- Simple JSON based geolocation API, powered by Google App Engine.☆106Updated 12 years ago
- Get text alerts when products become available on Amazon.☆169Updated 2 years ago
- Sup.js saves URL parameters and inputs them into any form submitted during a visitors session on you site.☆86Updated 2 years ago
- A Tail Story☆15Updated 2 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆150Updated 5 years ago
- Export your Hacker News saved links to JSON or CSV from the Chrome console.☆51Updated 8 years ago
- SRT Subtitles Merger☆95Updated 9 months ago