extractus / article-extractor
To extract main article from given URL with Node.js
β1,649Updated last week
Alternatives and similar repositories for article-extractor:
Users that are interested in article-extractor are comparing it to the libraries listed below
- Automatically extract body content (and other cool stuff) from an html documentβ2,154Updated last year
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.β2,402Updated 2 weeks ago
- π Turn any web page into a clean viewβ2,505Updated 3 years ago
- The headless Chrome/Chromium driver on top of Puppeteer.β1,673Updated this week
- Generate EPUB books from HTML with simple API in Node.js.β441Updated last year
- Advanced html to text converterβ1,637Updated last year
- Browser Extension Template with ESbuild builds, support for React, Preact, Typescript, Tailwind, Manifest V3/V2 support and multi browserβ¦β684Updated 9 months ago
- A RSS, Atom and JSON Feed generator for Node.js, making content syndication simple and intuitive! πβ1,217Updated 8 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.β343Updated 6 years ago
- API and CLI tool to fetch and query Chome DevTools heap snapshots.β1,353Updated last year
- Metadata scraper with support for oEmbed, Twitter Cards and Open Graph Protocol for Node.jsβ484Updated 10 months ago
- A lightweight RSS parser, for Node and the browserβ1,414Updated 3 months ago
- HTTP client made for scraping based on got.β611Updated last week
- JS port and JS/WASM bindings for openai/tiktokenβ795Updated last week
- A little router.β1,868Updated 3 months ago
- JavaScript Library to extract domains, subdomains and public suffixes from complex URIs.β546Updated this week
- A copy of the original Arc90 repo with links to many of the current ports.β224Updated 7 months ago
- Bree is a Node.js and JavaScript job task scheduler with worker threads, cron, Date, and human syntax. Built for @ladjs, @forwardemail, @β¦β3,112Updated 6 months ago
- 𧩠The cross-browser extension framework.β4,069Updated last month
- Work in progress transmit from Google Codeβ1,114Updated 7 years ago
- RSS-proxy allows you to do create an RSS or ATOM feed of almost any website, just by analyzing just the static HTML structure.β1,801Updated last month
- CSS Selector Generator πΊβ1,394Updated 2 months ago
- libvips for the browser and Node.js, compiled to WebAssembly with Emscripten.β718Updated this week
- π±οΈ Generate human-like mouse movements with puppeteer or on any 2D planeβ1,183Updated last week
- A triple-linked lists based DOM implementation.β1,769Updated last week
- Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.β321Updated 5 months ago
- π₯πWeb Extension starter to build "Write Once Run on Any Browser" extensionβ2,065Updated last year
- Keyboard shortcuts interface for your website. Working with static HTML, Vanilla JS, Vue, React, Svelte.β1,661Updated 7 months ago
- β‘ The fastest directory crawler & globbing library for NodeJS. Crawls 1m files in < 1sβ1,547Updated last week
- Static low-bandwidth search at scaleβ3,890Updated last week