chrisstiles / PublishDateBotLinks
A reddit bot that finds original publish dates on linked articles.
☆10Updated last year
Alternatives and similar repositories for PublishDateBot
Users that are interested in PublishDateBot are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Estimating the age of web resources☆97Updated 8 months ago
- Parse government documents into well formed JSON☆75Updated 3 weeks ago
- A helper library full of URL-related heuristics.☆73Updated this week
- Grabbing all news.☆61Updated 6 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆198Updated 2 weeks ago
- Cleans Reddit Text Data☆84Updated 5 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 11 months ago
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆43Updated last month
- A simple Python wrapper for the archive.is capturing service☆215Updated last year
- Convert Wikipedia database dumps into plaintext files☆327Updated 4 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆189Updated 3 weeks ago
- track changes to the news, where news is anything with an RSS feed☆182Updated 5 years ago
- ☆76Updated this week
- Python wrapper library for the Datamuse API☆83Updated 2 years ago
- Extract text from HTML☆134Updated 3 weeks ago
- Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more☆224Updated 2 years ago
- A browser extension to share data about your social feed with researchers and journalists to increase transparency.☆86Updated 2 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 5 years ago
- A web tool to convert Wiki tables to CSV 📈☆195Updated last year
- Contains scripts and data to render map of reddit☆126Updated 9 months ago
- Python Pushshift.io API Wrapper (for comment/submission search)☆363Updated 2 years ago
- Miscellaneous scripts to gather and process data of wikis.☆20Updated 2 years ago
- Home of the RECAP Chrome, Safari, and Firefox Extensions☆72Updated this week
- Scraper for downloading the entire ebooks repository of project Gutenberg☆155Updated last week
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆220Updated 2 years ago
- Creates github index for similar repositories discovery☆191Updated 9 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆18Updated 2 years ago
- ☆44Updated 4 years ago