hboisgibault / unicontent
Python module to extract structured metadata from URL, ISBN or DOI
☆12Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for unicontent
- A financial disclosure data extraction tool.☆13Updated last year
- A browser extension providing Open Access bibliographical services☆14Updated last year
- Functions for analysing public patenting data.☆15Updated 6 years ago
- Getting, analysing and displaying lists of papers☆13Updated last month
- A tool to extract canonical references from text.☆20Updated 3 years ago
- wrapper for the crossref events api☆17Updated last year
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27Updated 9 months ago
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆9Updated last year
- Get the scholarly citation for any research product: software, preprint, paper, or dataset☆70Updated last year
- A classifier that distinguishes political from non-political news articles.☆29Updated last year
- Presentations on Quantified Self and Self-Tracking with Python☆29Updated last year
- Open Access PDF harvester☆35Updated 6 months ago
- A helper library full of URL-related heuristics.☆64Updated last month
- Command-line and Python API to download PDFs directly from Sci-Hub☆12Updated 9 months ago
- Ask questions about government data.☆37Updated 5 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated last year
- A scraper focused on organizational Github accounts and their members.☆40Updated 2 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last month
- Examples for getting started using https://case.law☆64Updated 2 years ago
- Scrape various open data directories to create an index of what's available out there☆31Updated this week
- Scrapers for US municipal governments.☆10Updated last year
- A list of over 5000 US news domains and their social media accounts☆41Updated last year
- A scraping Master-slave system based on Google App Engine☆11Updated 4 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 2 months ago
- Datasette plugin for authenticating access using API tokens☆12Updated 2 months ago
- Named-Entity Recognition extension for OpenRefine☆24Updated last year
- A Twitter data collection and appraisal application.☆50Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆13Updated 3 years ago
- CLI implementation of httpreserve that can test links and retrieve internet archive replacements☆10Updated last week