palewire / storysniffer
Inspect a URL and estimate if it contains a news story
☆39Updated 5 months ago
Alternatives and similar repositories for storysniffer:
Users that are interested in storysniffer are comparing it to the libraries listed below
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- A maximum-strength name parser for record linkage.☆37Updated this week
- Machine assisted dossiers☆19Updated 7 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last month
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated last month
- Add website scraping abilities to Datasette☆62Updated 2 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- Add editing UI and other power-user features to Datasette.☆12Updated 2 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- How can we improve name matching in screening tools?☆12Updated 3 months ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Measure is scripts and conventions to build KPI dashboards for projects.☆17Updated 4 years ago
- Just charts. Really.☆22Updated last year
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Linked SDMX☆17Updated 10 years ago
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 9 months ago
- Datasette plugin for executing SQL queries from templates☆10Updated 3 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆24Updated 4 years ago
- An ArchieML parser for Python☆11Updated 9 years ago
- A tool for telling stories with maps.☆27Updated 7 months ago
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated last year
- Rig for deploying DocumentCloud viewers to S3.☆13Updated 3 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Updated 7 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year