indix / web-auto-extractor
Automatically extracts structured information from webpages
☆107Updated 2 years ago
Alternatives and similar repositories for web-auto-extractor:
Users that are interested in web-auto-extractor are comparing it to the libraries listed below
- A `htmlparser2` handler for parsing rich metadata from HTML. Includes HTML metadata, JSON-LD, RDFa, microdata, OEmbed, Twitter cards and …☆52Updated last year
- Scrape & parse a webpage to return a JSON with found microdata (schema.org)☆43Updated 7 years ago
- A suite of modules for text analysis, including simple analysis, nGrams, and TFIDF analysis☆49Updated 3 years ago
- Freeform Street Address Parser☆95Updated last year
- NodeJS bindings to libpostal for fast international address parsing/normalization☆226Updated 3 weeks ago
- Friendly web crawler for x-ray☆44Updated 2 years ago
- Multilingual tokenizer that automatically tags each token with its type☆61Updated last year
- Language agnostic named entity recognizer☆39Updated last year
- Parser for robots.txt for node.js☆67Updated 3 years ago
- sandcrawler.js - the server-side scraping companion.☆107Updated 9 years ago
- Deprecated plugin to detect sentiment: use `words/polarity`☆97Updated 3 months ago
- Cheerio based microdata parser☆57Updated 3 years ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 5 years ago
- A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.☆102Updated last year
- Higher level client for Elasticsearch written in Node.js oriented on facets and simplicity☆20Updated last week
- NodeJS Named Entity Recognition, using Stanford NER (easy install)☆40Updated 7 years ago
- Helps to extract shortest optimal css-selector and multi-selector.☆26Updated 7 years ago
- English NLP for Node.js and the browser.☆89Updated last year
- Node wrapper around FastText Library☆57Updated last year
- Article content extraction database☆40Updated last year
- Node library to extract keywords from text☆58Updated 9 years ago
- A JS Library that compares two DOM Nodes and outputs what changed between the two.☆154Updated 8 years ago
- text mining utilities for Node.js☆141Updated 2 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆53Updated 7 years ago
- ☆21Updated 7 years ago
- Nodejs text sumarization☆55Updated 11 years ago
- Lets you manage multiple channels of keywords on the same twitter stream☆70Updated 6 years ago
- MetaData html scraper and parser for Node.js (supports Promises and callback style)☆169Updated last week
- A nodejs Scraping Utility for lazy people. MIT Licensed☆44Updated 2 years ago
- A node.js module to help identify browser sessions☆59Updated 2 weeks ago