OpenScraping / openscraping-lib-nodejs
Turn unstructured HTML pages into structured data. The OpenScraping library can extract information from HTML pages using a JSON config file with xPath rules. It can scrape even multi-level complex objects such as tables and forum posts. This is the Node.js version.
☆12Updated 6 years ago
Alternatives and similar repositories for openscraping-lib-nodejs:
Users that are interested in openscraping-lib-nodejs are comparing it to the libraries listed below
- a tale of two applications. one built as a monolith. the same built with microservices☆15Updated 9 years ago
- Pipe community into your browser☆21Updated 7 years ago
- Multilingual DBpedia Spotlight for NodeJS☆13Updated 7 years ago
- Scrapes a remote page and creates a summary with statistics☆38Updated 10 years ago
- Parse e-mail address fields with node.js☆41Updated 8 years ago
- Hosted viewer for documentation.js JSON output.☆34Updated 7 years ago
- convert xml to json on the command line. not streaming, pure javascript☆37Updated 8 years ago
- Finds the "top" domain for a given URL.☆14Updated last year
- Extract meta-data from a html string. It extracts the body, title, meta-tags and first headlines to a object to push them to a search ind…☆13Updated 8 years ago
- Webrtc P2P through regular NodeJS stream☆14Updated 7 years ago
- JSON schema of prisma.yml files☆11Updated 4 years ago
- PWABuilder Core Library☆14Updated 3 years ago
- Files in Markdown.☆17Updated 5 months ago
- Server endpoint for communicating with stanford-ner server☆25Updated 7 years ago
- Natural language time parser for moment.js strings☆13Updated 8 years ago
- Live query. Mirror part of a DB on the client.☆12Updated 8 years ago
- The chainy core + autoloader plugin☆79Updated 8 years ago
- LevelGraph plugin for storing N3/Turtle/RDF data☆36Updated 5 years ago
- A simple DOM wrapper for libxmljs☆13Updated 2 years ago
- Extract rich metadata from URLs☆44Updated 2 years ago
- A streaming, backend agnostic SQL ORM heavily inspired by levelup☆67Updated 9 years ago
- LiveReload 2.x-3.x web socket / http server☆22Updated 8 years ago
- generate rules from lists of words☆16Updated 3 years ago
- Plugin to manage email notifications on records modification in a collection.☆12Updated last week
- Offline-first, PouchDB-powered, Markdown word processor app.☆10Updated 5 years ago
- Python client library for Apple iCloud☆29Updated 13 years ago
- minimal npm installer for phantomjs and slimerjs with zero npm dependencies☆10Updated 9 years ago
- node.js interface to the ConceptNet semantic network API [DEPRECATED; ConceptNet API has changed]☆30Updated 7 years ago
- Loop with setInterval until condition is true.☆12Updated last year
- DEPRECATED - Convert binary data to and from various string/integer representations☆11Updated 7 years ago