edgi-govdata-archiving / wayback
A Python API to the Internet Archive Wayback Machine
☆69Updated 6 months ago
Alternatives and similar repositories for wayback:
Users that are interested in wayback are comparing it to the libraries listed below
- Alternative robots parser module for Python☆17Updated 2 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last month
- A helper library full of URL-related heuristics.☆64Updated 4 months ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆174Updated 4 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆126Updated 10 months ago
- Some tools to help analyze the twitter archive☆62Updated 6 months ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- A list of over 5000 US news domains and their social media accounts☆43Updated 2 years ago
- ☆60Updated last month
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆23Updated 4 years ago
- A Python implementation of Lunr.js 🌖☆195Updated last month
- 🔍 PyPI package information at a glance for Python dependencies – a VS Code extension☆34Updated last week
- Accurately find/replace/remove emojis in text strings☆160Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A GitHub Action to run a pytest command when new code is pushed into your repo☆57Updated 3 months ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆19Updated 2 weeks ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Parse government documents into well formed JSON☆67Updated last week
- Common interface for data container classes☆66Updated last week
- Write Datasette canned queries as plain SQL files☆13Updated 2 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated last year
- Fast syllable estimation library based on pattern matching.☆37Updated last month
- Simple tool to pull posts and users from Gab☆16Updated this week
- A maximum-strength name parser for record linkage.☆36Updated last week
- A modern Python library for writing maintainable web scrapers.☆245Updated 7 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated 10 months ago
- A Python library for defining rule-based overrides on messy data☆13Updated 3 months ago
- Cookiecutter template for creating renamed PyPI packages☆57Updated last year
- Metadata extraction at a distance☆24Updated 2 weeks ago
- Easy rate-limiting for python requests☆90Updated last month