apify / actor-page-analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
☆150Updated 2 years ago
Alternatives and similar repositories for actor-page-analyzer:
Users that are interested in actor-page-analyzer are comparing it to the libraries listed below
- Dashboard is software for creating web apps and SaaS (support @ freenode #userdashboard)☆282Updated 4 years ago
- File-system-based database (in the git repo), with a server attached with users and access control for serving this data. See an example …☆63Updated 2 years ago
- OpenFaaS template for headless Chrome and Puppeteer☆91Updated last year
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- An OPML file with 22 of the top 25 US newspapers RSS feeds☆55Updated 6 years ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆202Updated last week
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆101Updated 6 years ago
- Scrapy rotation proxy package with advanced functions☆95Updated 2 years ago
- An interactive demo walk-through we built to give visitors a feel for what the Trevor.io platform does☆253Updated 5 years ago
- Query CSVs using SQL☆167Updated 5 years ago
- midas is a framework that enables you to enrich your CSV, JSON or Excel dataset with any web API you can think of.☆53Updated 6 years ago
- Notetaking Electron app that can answer your questions and makes summaries for you☆90Updated 2 years ago
- keywords-extract - Command line tool extract keywords from any web page.☆63Updated 6 years ago
- Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity☆351Updated 2 months ago
- Test your HN title against a neural network☆184Updated 4 years ago
- Demo of how to use self-host analytics.js☆26Updated last year
- Twitter AI Platform☆93Updated 7 years ago
- Automatically extracts structured information from webpages☆108Updated 2 years ago
- Sketch business models in your browser☆248Updated 2 months ago
- House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.☆120Updated last year
- JavaScript Library for Google Sheets/Microsoft Excel Online through sheet2api. https://sheet2api.com/☆92Updated 2 years ago
- unformatted text > parse/clean it > get relevant info☆52Updated 6 years ago
- Sheetsu Web Client☆177Updated 7 years ago
- Remote client for distributed automated HTTP(s) content fetching.☆77Updated last week
- A hacky node.js ad-hoc throw-away address mail forwarder.☆38Updated 6 years ago
- Offline bookmarking DB that syncs with Kozmos servers☆39Updated 5 years ago
- A web-based speech-to-code editor for humans.☆134Updated 2 years ago
- 📮 Dialogflow + Sendgrid = AI Mailbox☆35Updated 4 years ago
- Generate charts easily through a simple REST-like API☆35Updated 2 years ago
- A command-line tool to crawl websites using puppeteer.☆104Updated 2 years ago