JustAnotherArchivist / little-thingsLinks
The little things give you away... A collection of various small helper stuff β Mirror repo only, no longer kept in sync, refer to gitea.arpa.li instead
β24Updated 5 years ago
Alternatives and similar repositories for little-things
Users that are interested in little-things are comparing it to the libraries listed below
Sorting:
- A list of things related to software, literature, and other content for π£ Mementoβ105Updated last week
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β58Updated 5 months ago
- A collection of tools for archiving and analysing the internet.β78Updated 3 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ189Updated 3 weeks ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ98Updated 7 years ago
- Comparing warc filesβ17Updated 6 years ago
- data with similar subreddits graphβ48Updated 2 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β55Updated this week
- H2O is a web app for creating and reading open educational resources, primarily in the legal fieldβ43Updated 2 weeks ago
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.β25Updated 12 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.β14Updated 11 months ago
- Converts WARC files to static HTMLβ51Updated 4 months ago
- Some tools to help analyze the twitter archiveβ64Updated 8 months ago
- A helper library full of URL-related heuristics.β73Updated this week
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browserβ29Updated this week
- Presentations on Quantified Self and Self-Tracking with Pythonβ33Updated 3 years ago
- Grabbing all news.β61Updated 6 years ago
- Matrix-based News Aggregation to Explore Media Biasβ20Updated 7 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β42Updated last year
- Import data from Google Takeout to search and analyzeβ17Updated 3 years ago
- CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BERβ¦β17Updated 2 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSSβ33Updated last month
- A curated list of awesome twitter toolsβ228Updated 2 years ago
- A Memento Aggregator CLI and Server in Goβ76Updated 11 months ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β132Updated 2 months ago
- Mastodon bot that posts videos showcasing how random locations in the world have changed since 1984.β40Updated 11 months ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.β12Updated last year
- A dockerized, queued high fidelity web archiver based on Squidwarcβ61Updated last year
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Updated 7 months ago
- API client for Aleph, supports bulk entity and document upload.β29Updated last year