kaleguy / scraper-apiLinks
An HTML to JSON API webscraper for ResearchGate, adaptable for other sites
☆19Updated 6 years ago
Alternatives and similar repositories for scraper-api
Users that are interested in scraper-api are comparing it to the libraries listed below
Sorting:
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Get user ids from social network handlers☆12Updated 8 years ago
- modification of bibliotools 2.2 from Sébastian Grauwin☆11Updated 6 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
- Scrapes posts and comments from public Facebook pages.☆108Updated 6 years ago
- Scraper built with Scrapy.☆17Updated 9 months ago
- ProxyCrawl Node library for scraping and crawling☆23Updated last year
- A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…☆14Updated 6 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- Actor Network Text Analyser☆56Updated 10 years ago
- ☆25Updated 9 years ago
- Node.js application to extract the knowledge represented in Google infoboxes (aka Google Knowlege Graph Panel)☆26Updated 8 years ago
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.☆12Updated 3 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Exploits Wikipedia's daily view counts to find out what topics are current trends☆17Updated 12 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆57Updated 10 months ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Open Access PDF harvester☆40Updated last year
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
- Temporal Anomaly Detector (TAD)☆15Updated 7 years ago
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago