ContentMine / quickscrapeLinks
A scraping command line tool for the modern web
☆260Updated 8 years ago
Alternatives and similar repositories for quickscrape
Users that are interested in quickscrape are comparing it to the libraries listed below
Sorting:
- Facilitating the global conversation on academic literature☆266Updated 8 years ago
- View, visualize, clean and process data in the browser.☆147Updated 7 years ago
- Get metadata, fulltexts or fulltext URLs of papers matching a search query☆201Updated 5 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- Superfeedr powered pipes!☆131Updated 9 years ago
- Social Feed Manager user interface application.☆156Updated last year
- A novel way of viewing eLife articles.☆378Updated 3 years ago
- track changes to the news, where news is anything with an RSS feed☆179Updated 5 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Fluxtream Web Application and Core Modules☆152Updated 2 years ago
- Creates github index for similar repositories discovery☆193Updated 9 years ago
- A full-stack publishing solution involving different technologies to power digital archives☆158Updated 5 years ago
- ☆36Updated last year
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 10 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated last week
- BibServer is open-source software what makes it easy to publish, manage and find bibliographies. BibServer is RESTful and web-friendly.☆126Updated 6 years ago
- Publishing Framework for Large-Scale Data-Rich Interactive Web Pages☆178Updated 3 years ago
- Lens - open science content creation and display☆124Updated 8 years ago
- Websites crawler with built-in exploration and control web interface☆357Updated last month
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆108Updated 5 months ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated this week
- A Lua custom writer for Pandoc generating JATS XML☆76Updated 7 years ago
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆37Updated last year
- a NodeJS library for monitoring changes on Wikipedia sites☆70Updated 3 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆171Updated 5 years ago
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆261Updated 9 years ago
- Enhanced Social Tagging for Academic Communities☆97Updated 9 months ago
- Run Overview on your own system☆125Updated 4 years ago
- Transform any dataset into an HTTP API with The DataTank☆82Updated 5 years ago
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 8 years ago