kaleguy / scraper-api
An HTML to JSON API webscraper for ResearchGate, adaptable for other sites
☆19Updated 5 years ago
Related projects: ⓘ
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 3 years ago
- modification of bibliotools 2.2 from Sébastian Grauwin☆12Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated 7 months ago
- An api to parse a CV, in particular the elements of its publication list☆35Updated 6 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆45Updated 2 years ago
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Get user ids from social network handlers☆12Updated 7 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated last year
- A repo that contains outgoing links from DBpedia☆50Updated 4 years ago
- ☆11Updated 3 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 6 years ago
- Matches a category of Google's Taxonomy to product that is described in any kind of text data☆57Updated 6 years ago
- A system to generate SPARQL queries from natural language queries.☆30Updated 6 months ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆46Updated 2 years ago
- Detect twitter bots in your newsfeed.☆26Updated 4 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 7 years ago
- Download DIG to run on your laptop or server.☆101Updated 5 years ago
- Graph NLU is a natural language understanding tool that leverages the power of graph databases☆85Updated 6 years ago
- ☆18Updated this week
- Scrapes sites. Gets news. Eventually events.☆80Updated 8 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Updated 5 years ago
- Processing OpenCitations Data☆17Updated 7 years ago
- The DBpedia DataID vocabulary is a metadata system for detailed descriptions of datasets and their physical instances, as well as their r…☆35Updated last year
- Common Crawl Index Server☆65Updated 8 months ago
- A network graph exploration tool☆63Updated last year
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆29Updated 11 months ago
- Python module for bibliographic network analysis.☆81Updated 3 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year