A library to extract a publication date from a web page, along with a measure of the accuracy.
☆41Aug 13, 2019Updated 6 years ago
Alternatives and similar repositories for date_guesser
Users that are interested in date_guesser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Find rss, atom, xml, and rdf feeds on webpages☆31Nov 6, 2025Updated 5 months ago
- The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)☆65Dec 14, 2023Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Mar 1, 2023Updated 3 years ago
- Presentation for the NYU Data Lab December 2015☆14Dec 2, 2015Updated 10 years ago
- ☆10Nov 2, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Flask code to deploy an API that pulls structured data from online news articles☆231Dec 8, 2022Updated 3 years ago
- extract difference between two html pages☆33Updated this week
- R package for turning Ethnic NewsWatch search results into tidyverse-ready dataframes☆11Dec 7, 2021Updated 4 years ago
- Classification of incivility in Reddit posts☆18Nov 19, 2020Updated 5 years ago
- ☆11May 31, 2019Updated 6 years ago
- Read Text Data☆26Oct 25, 2019Updated 6 years ago
- Revealing the Omitted - An Exploration of Media Bias in the news coverage of Obamacare. Employs Selenium and BeautifulSoup to scrape over…☆17Feb 9, 2019Updated 7 years ago
- Solution for the 2nd place in Telegram Data Clustering Contest (https://contest.com/docs/data_clustering2).☆12Nov 19, 2020Updated 5 years ago
- Ultimate Website Sitemap Parser☆247Jan 25, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Minimal working example for a binder with both R and Python Jupyter and RMarkdown notebooks☆31Mar 26, 2019Updated 7 years ago
- Create nice looking output for CFA and SEM analyses using lavaan and semPlot packages☆22Mar 22, 2024Updated 2 years ago
- Corpus of Attribution-Annotated news articles covering the campaigns during the year leading up to the 2016 US Presidential election.☆20Jun 19, 2018Updated 7 years ago
- Social Feed Manager user interface application.☆157Jun 25, 2024Updated last year
- ☆13Apr 11, 2023Updated 3 years ago
- Half-baked idea: Conceptual building blocks for data analysis.☆11May 7, 2015Updated 10 years ago
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- Inspect a URL and estimate if it contains a news story☆39Feb 11, 2026Updated 2 months ago
- Shows how to encrypt data held in public space☆11Aug 11, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pretty-paste source code on Slack☆14Jun 16, 2020Updated 5 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆156Jul 18, 2025Updated 8 months ago
- Another next-generation event coding platform.☆77Mar 18, 2019Updated 7 years ago
- This is a python script that work in conjunction with a workflow (you need Workflow for ios) to enable open scripts .py (Only .py for now…☆12Jan 25, 2015Updated 11 years ago
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- Chrome extension to extract schema from the airtable.com/api page☆11Nov 29, 2018Updated 7 years ago
- standalone autoreload script☆19Sep 28, 2015Updated 10 years ago
- Openstack logs - export errors and other usefully modes☆14Feb 27, 2026Updated last month
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆83Feb 27, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- python 3.7 asyncio tutorial.☆14Aug 24, 2019Updated 6 years ago
- ProxyCrawl Python library for scraping and crawling☆58Jul 4, 2023Updated 2 years ago
- A collection of datasets from Skolverket☆11Sep 1, 2020Updated 5 years ago
- Minimal web-based client for NewsBlur.☆19Dec 7, 2014Updated 11 years ago
- Run your Selenium BDD (Behaviour Driven Development) test cases in Docker. Python, Selenium, Behave, Chrome, Docker. Page object mode (PO…☆12Nov 5, 2024Updated last year
- Price Spider is a Python tool to get price & promotion from JD, Tmall, Amazon, BeiBei☆10Jun 14, 2019Updated 6 years ago
- Python tool to monitor RSS feeds and download the linked content.☆15Sep 4, 2017Updated 8 years ago