chrisstiles / PublishDateBotLinks
A reddit bot that finds original publish dates on linked articles.
☆10Updated last year
Alternatives and similar repositories for PublishDateBot
Users that are interested in PublishDateBot are comparing it to the libraries listed below
Sorting:
- Python Pushshift.io API Wrapper (for comment/submission search)☆363Updated 2 years ago
- A browser extension to share data about your social feed with researchers and journalists to increase transparency.☆86Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆188Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 months ago
- Parse government documents into well formed JSON☆75Updated this week
- A simple Python wrapper for the archive.is capturing service☆209Updated 10 months ago
- A Python library that provides an api to search and get links from Books,Magazines,Comics,... from Library Genesis.☆123Updated 3 years ago
- ☆79Updated 7 years ago
- A Flask webapp & Python scripts for predicting reddit users' political leaning, using their comment history.☆63Updated 2 years ago
- 🤖 Making a Reddit Bot using Python, Heroku and Heroku Postgres.☆121Updated 2 years ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆224Updated 2 years ago
- Python 3 wrapper for the Letterboxd API v0☆60Updated last year
- A helper library full of URL-related heuristics.☆73Updated 3 months ago
- Miscellaneous scripts to gather and process data of wikis.☆20Updated 2 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆220Updated 2 years ago
- Estimating the age of web resources☆97Updated 7 months ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆195Updated last week
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆180Updated 5 months ago
- Utilize your personal data like Google!☆161Updated 2 years ago
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆58Updated 3 years ago
- Convert Wikipedia database dumps into plaintext files☆326Updated 4 years ago
- Grabbing all news.☆61Updated 6 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆18Updated 2 years ago
- A set of utilities for processing MediaWiki XML dump data.☆61Updated 10 months ago
- Scraping assistant tool. Editing and maintaining CSS/XPath selectors across webpages.☆124Updated 7 years ago
- Google News RSS as OPML☆25Updated 7 years ago
- Examples for getting started using https://case.law☆70Updated 3 years ago