JustAnotherArchivist / little-thingsLinks
The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.arpa.li instead
☆24Updated 4 years ago
Alternatives and similar repositories for little-things
Users that are interested in little-things are comparing it to the libraries listed below
Sorting:
- Bot for operating snscrape in #archivebot on efnet☆10Updated 5 years ago
- Decentralized web archiving☆20Updated 6 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated this week
- CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BER…☆15Updated 2 years ago
- A Google Trends Analytics Package☆13Updated last year
- Comparing warc files☆17Updated 6 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆56Updated 10 months ago
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- Scripts for Internet Archive☆13Updated 3 months ago
- Trough: Big data, small databases.☆42Updated 11 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆60Updated 11 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 8 months ago
- 📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity☆95Updated 6 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆32Updated last month
- Proxies third-party PDF files and HTML pages with the Hypothesis client embedded, so you can annotate them☆23Updated last week
- Track changes to GraphQL APIs by git scraping their schemas☆28Updated 2 months ago
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆25Updated 2 years ago
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc☆28Updated 4 years ago
- Paste in some broken unicode text and FTFY will tell you how to fix it!☆67Updated 2 years ago
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆38Updated last month
- youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling☆53Updated last year
- Scripts for Wikidata☆20Updated 3 months ago
- Extending conceptual thinking with semantic embeddings.☆36Updated 3 years ago
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago
- A Tumblr-scraping text post bot☆14Updated 7 years ago
- SenateTrades: what stocks are your senators buying?☆31Updated 2 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- A simple bot framework for commenting in subreddits.☆12Updated 7 years ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆25Updated last month
- A Memento Aggregator CLI and Server in Go☆65Updated 3 months ago