ahkimkoo / node-article-extractorLinks
Automatically extract body content (and other cool stuff) from an html document. based on https://github.com/ageitgey/node-unfluff, but support Chinese.
☆17Updated 4 years ago
Alternatives and similar repositories for node-article-extractor
Users that are interested in node-article-extractor are comparing it to the libraries listed below
Sorting:
- Automatically extract body content (and other cool stuff) from an html document☆2,161Updated 2 years ago
- Use puppeteer to test and control your electron application.☆356Updated 2 years ago
- Generate EPUB books from HTML with simple API in Node.js.☆454Updated 2 years ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,688Updated 3 years ago
- Get all urls in a string☆371Updated 2 years ago
- Node.js global keyboard and mouse listener.☆1,268Updated 9 months ago
- plugin to extract keywords and key-phrases☆337Updated last year
- NPM package for creating a keyword array from a string and excluding stop words.☆200Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,420Updated 10 months ago
- Part-of-speech utilities for node.js based on the WordNet database.☆476Updated 2 years ago
- Download website to local directory (including all css, images, js, etc.)☆1,657Updated last week
- Read data from a Word document using node.js☆148Updated last year
- Advanced html to text converter☆1,679Updated 2 years ago
- A persistent, network resilient, full text search library for the browser and Node.js☆1,420Updated 8 months ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated last year
- Read and modify exif in client-side or server-side JavaScript.☆609Updated 4 years ago
- use axios through tor network☆29Updated 2 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Updated 7 years ago
- A lightweight RSS parser, for Node and the browser☆1,489Updated 3 months ago
- Easy website screenshots in Node.js☆2,119Updated 6 years ago
- Node.js program that takes screenshots at smooth intervals of web pages with JavaScript animations☆240Updated last year
- Better file system API for Node.js☆780Updated last year
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,979Updated 2 years ago
- The JavaScript Database, for Node.js, nw.js, electron and the browser☆416Updated 5 months ago
- A tiny, full-featured, flexible client / server library for the Twitter API☆788Updated 2 years ago
- Puppeteer(Chrome headless node API) based web page renderer☆329Updated last week
- Get active window title in Node.js.☆160Updated 6 years ago
- Simple node.js utility to create video slideshows from images with optional audio and visual effects using ffmpeg☆898Updated 3 weeks ago
- RSS feed generator for Node.☆1,040Updated 3 weeks ago
- JavaScript HTML to JSON Parser☆942Updated 8 months ago