A scraping command line tool for the modern web
☆260Sep 23, 2016Updated 9 years ago
Alternatives and similar repositories for quickscrape
Users that are interested in quickscrape are comparing it to the libraries listed below
Sorting:
- Headless scraperJSON scraping for Node.js☆27Sep 14, 2016Updated 9 years ago
- Journal scraper definitions for the ContentMine framework☆67Jul 12, 2018Updated 7 years ago
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆37Jan 22, 2024Updated 2 years ago
- Get metadata, fulltexts or fulltext URLs of papers matching a search query☆202Jul 15, 2020Updated 5 years ago
- ☆13Jul 2, 2017Updated 8 years ago
- Facilitating the global conversation on academic literature☆268May 21, 2017Updated 8 years ago
- CLI for creating databases for Data Quality Dashboards.☆19Oct 26, 2019Updated 6 years ago
- List of ressources to learn about open data, initially for a class at SciencesPo Paris☆18Sep 13, 2016Updated 9 years ago
- Set up MIT's CLIFF geolocation service with Vagrant☆17May 5, 2015Updated 10 years ago
- Simulate print CSS media using JavaScript☆13Nov 11, 2017Updated 8 years ago
- various tools to download, convert and process the full text of scientific articles☆10Apr 2, 2024Updated last year
- Creation of scientific articles in various output formats (DOCX, ODT, PDF, LATEX, HTML, EPUB) with markdown/ pandoc☆22May 9, 2017Updated 8 years ago
- The One True Open Access Button - cross-compatible extension for research papers and data.☆49Oct 8, 2024Updated last year
- Send a build status to Bitbucket using the Bitbucket Cloud Build Status Notifier (BCBSN).☆10Jan 3, 2023Updated 3 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- Archive and make discoverable data and links with schema.org metadata.☆38Nov 4, 2014Updated 11 years ago
- Expand tags by rendering local or remote RDF resources, recursively.☆10Dec 8, 2022Updated 3 years ago
- Scraper built with Scrapy.☆18Aug 14, 2024Updated last year
- Control the DOM from Python using Websockets☆12Mar 5, 2018Updated 7 years ago
- Don't let *them* read your mail. Encrypt it now.☆18Jun 13, 2018Updated 7 years ago
- Automatically add links to your WordPress content.☆25Aug 29, 2017Updated 8 years ago
- An implementation of Nextflow.io with Language Workbench Technology. The project helps create computational pipelines that run with the N…☆22Aug 9, 2016Updated 9 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Dec 12, 2016Updated 9 years ago
- 'Git for Tabular Data'☆46Jun 23, 2016Updated 9 years ago
- Manage and load dataprotocols.org Data Packages☆27Sep 17, 2015Updated 10 years ago
- Right to Education Index website☆16Dec 15, 2022Updated 3 years ago
- ☆11Sep 24, 2015Updated 10 years ago
- Algorithms for Finnish open goverment data☆13Updated this week
- MOVED to https://gitlab.com/crossref/rest_api☆17Apr 25, 2022Updated 3 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- ECTOR is a learning chatterbot. pyECTOR is its python version.☆13Jun 24, 2018Updated 7 years ago
- Working towards a US State Open Data Census☆11Apr 28, 2015Updated 10 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- Auxiliary infrastructure for the Open Science Prize☆10Mar 14, 2017Updated 8 years ago
- API and interface for CSV normalization and linking☆14May 15, 2018Updated 7 years ago
- ☆11Sep 29, 2015Updated 10 years ago
- Infrastructure code to support DNA pipeline☆38May 5, 2015Updated 10 years ago
- Blog crawler for the blogforever project.☆23Jan 31, 2014Updated 12 years ago
- R package for processing and analysing light logger and optical radiation dosimeter data☆18Jan 20, 2026Updated last month