kaleguy / scraper-apiLinks
An HTML to JSON API webscraper for ResearchGate, adaptable for other sites
☆20Updated 6 years ago
Alternatives and similar repositories for scraper-api
Users that are interested in scraper-api are comparing it to the libraries listed below
Sorting:
- Virtual patent marking crawler at iproduct.epfl.ch☆15Updated 8 years ago
 - Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated last year
 - This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆47Updated 3 years ago
 - The first P2N from scratch version☆24Updated 8 years ago
 - Graph of related videos from YouTube☆111Updated 2 years ago
 - Journal scraper definitions for the ContentMine framework☆67Updated 7 years ago
 - Scrapes posts and comments from public Facebook pages.☆109Updated 6 years ago
 - Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
 - [archived]☆18Updated 4 years ago
 - OSCON Twitter Graph. Loads tweets from Twitter API mentioning OSCON or Neo4j into a Neo4j Graph Database for analysis.☆44Updated 3 years ago
 - Viewers for statistics and dashboarding of Domain Search Engine data☆124Updated 9 years ago
 - Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
 - Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
 - General Architecture for Text Engineering☆49Updated 9 years ago
 - A project that keeps history of trending topics on Twitter.☆36Updated 8 years ago
 - Convert text from PDF to XML.☆45Updated 7 years ago
 - A simple package allowing to use WebGraph data in Python (via the Jython interpreter).☆20Updated 5 years ago
 - A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆192Updated 4 years ago
 - Statistical text analysis and semantic networks with Python☆13Updated 7 years ago
 - Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 3 years ago
 - A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…☆14Updated 7 years ago
 - Common Crawl Index Server☆70Updated 8 months ago
 - Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
 - Tweets Sentiment Analyzer☆52Updated 13 years ago
 - GHRecommender - personalized recommendations for GitHub projects based on information about repositories starred by the user☆27Updated 2 years ago
 - Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆87Updated 5 years ago
 - Demo of the Newspaper article extraction library.☆29Updated 10 years ago
 - Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆48Updated 3 years ago
 - A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 8 months ago
 - Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago