chrisstiles / PublishDateBotLinks
A reddit bot that finds original publish dates on linked articles.
☆10Updated 10 months ago
Alternatives and similar repositories for PublishDateBot
Users that are interested in PublishDateBot are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Grabbing all news.☆62Updated 5 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆184Updated 11 months ago
- Download subreddit comments☆96Updated 3 years ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆221Updated last year
- A simple Python wrapper for the archive.is capturing service☆204Updated 8 months ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Python 3 wrapper for the Letterboxd API v0☆60Updated 10 months ago
- Parse government documents into well formed JSON☆73Updated 2 months ago
- A browser extension to share data about your social feed with researchers and journalists to increase transparency.☆86Updated 2 years ago
- Estimating the age of web resources☆96Updated 4 months ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆184Updated last week
- Python Pushshift.io API Wrapper (for comment/submission search)☆14Updated 4 years ago
- 🤖 Making a Reddit Bot using Python, Heroku and Heroku Postgres.☆120Updated 2 years ago
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆42Updated last month
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆118Updated last year
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 5 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆221Updated 2 years ago
- Unreliable News Index (for Columbia Journalism Review)☆56Updated 3 years ago
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- A Flask webapp & Python scripts for predicting reddit users' political leaning, using their comment history.☆64Updated 2 years ago
- Home of the RECAP Chrome, Safari, and Firefox Extensions☆67Updated 3 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated 2 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 7 months ago
- ☆73Updated last week
- track changes to the news, where news is anything with an RSS feed☆179Updated 5 years ago
- ☆81Updated 6 years ago