ContentMine / quickscrape
A scraping command line tool for the modern web
☆260Updated 7 years ago
Related projects: ⓘ
- Journal scraper definitions for the ContentMine framework☆66Updated 6 years ago
- Get metadata, fulltexts or fulltext URLs of papers matching a search query☆197Updated 4 years ago
- A full-stack publishing solution involving different technologies to power digital archives☆154Updated 4 years ago
- The scraperJSON standard for defining web scrapers as JSON objects☆33Updated 10 years ago
- Headless scraperJSON scraping for Node.js☆27Updated 8 years ago
- Enhanced Social Tagging for Academic Communities☆93Updated last year
- Facilitating the global conversation on academic literature☆263Updated 7 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago
- Fluxtream Web Application and Core Modules☆151Updated last year
- A deployable web platform for collaborative conversation, ideation & sense-making. Use it for free at☆125Updated 5 years ago
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆105Updated last year
- A novel way of viewing eLife articles.☆376Updated 2 years ago
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆82Updated 6 years ago
- Simple text proofreader based on 'write-good' (hemingway-app-like suggestions) and 'nodehun' (spelling).☆333Updated 6 years ago
- An online annotation platform for teaching and learning in the humanities.☆105Updated 5 months ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆130Updated 2 weeks ago
- A platform for collaborative social media verification☆55Updated 7 years ago
- Websites crawler with built-in exploration and control web interface☆328Updated 2 weeks ago
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆260Updated 8 years ago
- Python scripts for interacting with the hypothes.is API☆49Updated 7 years ago
- Extract case law citations with Node☆55Updated 10 years ago
- BibServer is open-source software what makes it easy to publish, manage and find bibliographies. BibServer is RESTful and web-friendly.☆126Updated 5 years ago
- Open source large document set visualization platform☆269Updated last year
- Data Pipes for CSV☆117Updated last year
- Run Overview on your own system☆124Updated 3 years ago
- Client for the Crossref API☆32Updated 3 years ago
- High-level build project for all LAPDF-Text submodules☆103Updated 9 years ago
- Superfeedr powered pipes!☆131Updated 9 years ago
- An application that brings humanities research methods to data visualization.☆170Updated 3 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago