ContentMine / quickscrape
A scraping command line tool for the modern web
☆260Updated 8 years ago
Alternatives and similar repositories for quickscrape:
Users that are interested in quickscrape are comparing it to the libraries listed below
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Get metadata, fulltexts or fulltext URLs of papers matching a search query☆197Updated 4 years ago
- Python scripts for interacting with the hypothes.is API☆48Updated 7 years ago
- An online annotation platform for teaching and learning in the humanities.☆107Updated last week
- Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML☆37Updated last year
- Highlight and select phrases in HTML pages.☆24Updated 5 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago
- Create a git repository from the revision history of a document in Google Drive.☆134Updated 7 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆171Updated 4 years ago
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆120Updated 9 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- track changes to the news, where news is anything with an RSS feed☆178Updated 4 years ago
- Websites crawler with built-in exploration and control web interface☆340Updated 3 weeks ago
- Social Feed Manager user interface application.☆155Updated 7 months ago
- Palladio Application☆40Updated 3 years ago
- Publishing Framework for Large-Scale Data-Rich Interactive Web Pages☆176Updated 3 years ago
- Explore networks and publish narratives.☆52Updated 4 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago
- Proof of concept to dynamically generate RESTful APIs from static CSVs☆334Updated 7 years ago
- Viewers for statistics and dashboarding of Domain Search Engine data☆122Updated 9 years ago
- One trick pony NLP library for extracting keywords from HTML documents☆18Updated 9 years ago
- Command-line interface for After the Deadline language checker☆105Updated 5 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- A WordPress plugin for aggregating data via the hypothes.is API.☆26Updated 6 years ago
- Browser version of Hyphe (WIP)☆30Updated 4 months ago
- A deployable web platform for collaborative conversation, ideation & sense-making. Use it for free at☆129Updated 6 years ago
- A tool that enables you to access, query, and publish web APIs without programming.☆21Updated last year
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 9 years ago
- lightweight note management system☆20Updated 2 years ago
- Open source large document set visualization platform☆268Updated 2 years ago