kaleguy / scraper-api
An HTML to JSON API webscraper for ResearchGate, adaptable for other sites
☆19Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for scraper-api
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆14Updated 10 years ago
- [archived]☆18Updated 3 years ago
- A project that keeps history of trending topics on Twitter.☆34Updated 7 years ago
- Tools to work with patent files released by Google.☆19Updated 11 years ago
- Examples of bad data, especially from government.☆22Updated 3 months ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Processing OpenCitations Data☆17Updated 7 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructio…☆23Updated 6 years ago
- Open Access PDF harvester☆35Updated 6 months ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 9 months ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 6 months ago
- GHRecommender - personalized recommendations for GitHub projects based on information about repositories starred by the user☆26Updated last year
- A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…☆14Updated 6 years ago
- A simple Web crawler for stackshare.io using scrapy .☆9Updated 5 years ago
- A tool for retrieving articles from PubMed. Can work as a standalone application or in conjunction with the author disambiguation applica…☆11Updated last year
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆53Updated 4 months ago
- Classifier for predicting user interests based on Twitter profile and using Python library scikit-learn.☆31Updated 11 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- Twitter user classification tutorial at PyCon France 2016☆21Updated last year
- Scrapes posts and comments from public Facebook pages.☆107Updated 5 years ago
- Self-Service Semantic Suite (S4)☆17Updated 8 years ago
- Python module for bibliographic network analysis.☆84Updated 4 years ago
- Browser version of Hyphe (WIP)☆29Updated last month
- Simple program that summarize text.☆10Updated 14 years ago
- Simple duckduckgo results scraping☆67Updated 7 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago