palewire / storysniffer
Inspect a URL and estimate if it contains a news story
☆39Updated 3 weeks ago
Related projects: ⓘ
- Machine assisted dossiers☆19Updated 6 years ago
- Add website scraping abilities to Datasette☆59Updated last year
- A maximum-strength name parser for record linkage.☆29Updated last month
- Provide partial dates and retain the date precision through processing☆13Updated last year
- How can we improve name matching in screening tools?☆11Updated 5 months ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Extract networks of entities from journalistic reporting☆46Updated last year
- Archive of political ad data from the Federal Communications Commission☆20Updated 6 years ago
- Tools for tracking stories on news homepages☆48Updated 4 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 8 years ago
- Python parser for the Archie Markup Language (ArchieML)☆11Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- python utilities for Open Civic Data☆34Updated 4 months ago
- Measure is scripts and conventions to build KPI dashboards for projects.☆17Updated 4 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆22Updated 7 months ago
- Data and experiments with world population densities for comparison to addresses☆13Updated 7 years ago
- Add editing UI and other power-user features to Datasette.☆12Updated last year
- A Python client for parsing SCOTUS cases from the granted/noted and orders dockets. https://pypi.python.org/pypi/nyt-docket☆15Updated 6 years ago
- An ArchieML parser for Python☆10Updated 8 years ago
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- ☆12Updated last year
- Free, open source data science metrics for MailChimp email lists, delivered via an email report☆21Updated last year
- Rig for deploying DocumentCloud viewers to S3.☆13Updated 2 years ago
- Front-end for the MediaCloud database☆16Updated 6 years ago
- Demonstration project for building out a data news rig.☆10Updated 2 years ago
- A tool for telling stories with maps.☆24Updated 2 months ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Updated 6 years ago