A library to extract a publication date from a web page, along with a measure of the accuracy.
☆41Aug 13, 2019Updated 6 years ago
Alternatives and similar repositories for date_guesser
Users that are interested in date_guesser are comparing it to the libraries listed below
Sorting:
- Find rss, atom, xml, and rdf feeds on webpages☆31Nov 6, 2025Updated 3 months ago
- The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)☆65Dec 14, 2023Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Mar 1, 2023Updated 3 years ago
- Presentation for the NYU Data Lab December 2015☆14Dec 2, 2015Updated 10 years ago
- 🛠 Useful R functions for various things☆18Jul 4, 2019Updated 6 years ago
- R package for turning Ethnic NewsWatch search results into tidyverse-ready dataframes☆11Dec 7, 2021Updated 4 years ago
- Material for CULS frontend course☆18Dec 10, 2019Updated 6 years ago
- Revealing the Omitted - An Exploration of Media Bias in the news coverage of Obamacare. Employs Selenium and BeautifulSoup to scrape over…☆17Feb 9, 2019Updated 7 years ago
- API Wrapper for the mediacloud.org API☆16Aug 20, 2019Updated 6 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆119Aug 10, 2023Updated 2 years ago
- Paper and code for Morey and Lakens (in prep.)☆24Aug 3, 2017Updated 8 years ago
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Jun 19, 2018Updated 7 years ago
- Inspect a URL and estimate if it contains a news story☆39Feb 11, 2026Updated 3 weeks ago
- Social Feed Manager user interface application.☆157Jun 25, 2024Updated last year
- Read Text Data☆26Oct 25, 2019Updated 6 years ago
- extract difference between two html pages☆32Feb 10, 2026Updated 3 weeks ago
- Synthetic Text Dataset Generation for LLM projects☆56Feb 27, 2026Updated last week
- A classifier that distinguishes political from non-political news articles.☆31Jul 30, 2023Updated 2 years ago
- Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online me…☆284Nov 20, 2023Updated 2 years ago
- Minimal working example for a binder with both R and Python Jupyter and RMarkdown notebooks☆31Mar 26, 2019Updated 6 years ago
- ☆12Apr 23, 2018Updated 7 years ago
- Another next-generation event coding platform.☆77Mar 18, 2019Updated 6 years ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Aug 14, 2024Updated last year
- Print multiple stm model dashboards to a pdf file for inspection☆41Dec 31, 2019Updated 6 years ago
- A collection of datasets from Skolverket☆11Sep 1, 2020Updated 5 years ago
- Causality in Knowledge Graphs☆11Oct 12, 2022Updated 3 years ago
- ☆10Apr 6, 2023Updated 2 years ago
- Simple and easy-to-use scraper and crawler in Go.☆12May 4, 2020Updated 5 years ago
- Minimalist library for LLM usage☆13Sep 7, 2025Updated 5 months ago
- Ultimate Website Sitemap Parser☆243Jan 25, 2026Updated last month
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆156Jul 18, 2025Updated 7 months ago
- Data pipelines for AI applications☆12Feb 2, 2026Updated last month
- ⚡️Github action for dokku☆14Jan 30, 2020Updated 6 years ago
- automate mailchimp reports using google apps script and google spreadsheets☆10Mar 9, 2014Updated 11 years ago
- ☆11Mar 15, 2017Updated 8 years ago
- A simple maintenance tracking tool for your vehicles.☆12Nov 1, 2025Updated 4 months ago
- Python script to assemble individual Tweets from a public Twitter stream (either Gnip activity-streams format or original Twitter API for…☆12Aug 30, 2016Updated 9 years ago
- Data for SDI detection (SUPP.AI)☆10Sep 13, 2021Updated 4 years ago
- Collaborative Synchronized Corpus Annotation Tool☆11Dec 31, 2018Updated 7 years ago