ContentMine / quickscrape
A scraping command line tool for the modern web
☆259Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for quickscrape
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Get metadata, fulltexts or fulltext URLs of papers matching a search query☆197Updated 4 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago
- BibServer is open-source software what makes it easy to publish, manage and find bibliographies. BibServer is RESTful and web-friendly.☆126Updated 5 years ago
- Facilitating the global conversation on academic literature☆263Updated 7 years ago
- A full-stack publishing solution involving different technologies to power digital archives☆155Updated 4 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆87Updated last year
- Data Pipes for CSV☆117Updated last year
- A queue-controlled browser automation tool for improving web crawl quality☆60Updated 4 years ago
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆105Updated last year
- A framework for creating web-based knowledge maps☆197Updated this week
- An online annotation platform for teaching and learning in the humanities.☆106Updated 3 weeks ago
- Data Store for Annotation Studio☆46Updated last year
- Enhanced Social Tagging for Academic Communities☆94Updated last month
- Computer assisted video/audio transcription☆97Updated 4 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago
- Visualise Wikipedia page edits using History Flow☆48Updated 7 years ago
- Convert an XML input to a JSON output, using xml-mapping☆161Updated 8 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆123Updated 3 years ago
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 9 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆169Updated 4 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 4 years ago
- Take the hassle out of web scraping☆461Updated last year
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆82Updated 6 years ago
- Scholar Ninja - Chrome extension. A distributed open search engine for scholarly content, based on a WebRTC DHT network☆114Updated 8 years ago
- Run Overview on your own system☆123Updated 3 years ago
- A novel way of viewing eLife articles.☆375Updated 2 years ago
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆36Updated 10 months ago
- A tool for editing CSV & JSON files from your computer & from GitHub.☆48Updated 7 years ago
- Social Feed Manager user interface application.☆153Updated 5 months ago