JustAnotherArchivist / little-thingsLinks
The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.arpa.li instead
☆24Updated 5 years ago
Alternatives and similar repositories for little-things
Users that are interested in little-things are comparing it to the libraries listed below
Sorting:
- data with similar subreddits graph☆48Updated 2 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆55Updated 3 months ago
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆43Updated last month
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆188Updated last year
- Comparing warc files☆17Updated 6 years ago
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆98Updated 7 years ago
- A Memento Aggregator CLI and Server in Go☆71Updated 9 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆52Updated 2 weeks ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Updated 5 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 9 months ago
- Home of the RECAP Chrome, Safari, and Firefox Extensions☆68Updated this week
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆10Updated last year
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- A financial disclosure data extraction tool.☆18Updated 2 years ago
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser☆27Updated 8 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆59Updated last year
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated last year
- An automated, programming-free web scraper for interactive sites☆111Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Public API client for GETTR, a "non-bias [sic] social network," designed for data archival and analysis.☆95Updated this week
- SenateTrades: what stocks are your senators buying?☆35Updated 3 years ago
- Converts WARC files to static HTML☆49Updated 2 months ago
- Import data from Google Takeout to search and analyze☆17Updated 2 years ago
- Awesome links related to RSS, ATOM, and Syndication formats.☆61Updated last year
- The sequel to Big Cases Bot☆27Updated last month
- View browser history as a graph (Chrome extension)☆45Updated last year
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 8 years ago
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.☆25Updated 12 years ago