ArchiveTeam / github-grab
Archiving GitHub
☆9Updated 3 months ago
Alternatives and similar repositories for github-grab:
Users that are interested in github-grab are comparing it to the libraries listed below
- Archiving URLs (outlinks) from a variety of sources.☆20Updated 3 weeks ago
- ☆8Updated 4 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆25Updated 7 months ago
- SSH to POST. For making weird, SSH-based pastebins.☆16Updated 3 years ago
- Why is SponsorBlock down?!☆13Updated 8 months ago
- A GitHub action to toot from a repository☆22Updated last week
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆14Updated 3 years ago
- Paradux: recover from maximum data disaster☆20Updated 2 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 5 months ago
- In memoriam of Webby (1992-2022). May you rest in peace. Life is sometime strange, you have to keep fighting!☆7Updated 3 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆17Updated 5 months ago
- Fit a video into a 10mb file (Discord nitro pls?)☆17Updated last week
- Mastodon bot that boosts trending posts from other instances into your timeline☆21Updated 5 months ago
- Detect and invoke build systems☆20Updated this week
- 🎺🐤👱♂️ Automatically updated dump of Truth Social's source code (reskinned Mastodon)☆14Updated 5 months ago
- A backward-compatible subset of the PNG file format, for uncompressed bitmaps☆24Updated 5 months ago
- Archiving public telegram messages.☆12Updated 2 months ago
- Nondestructive warc-in-tar to warc conversion☆26Updated 11 years ago
- Perform garbage collection on all git repos in a given directory☆25Updated 2 years ago
- Overview of telecommunication standards and technologies for internet access☆14Updated 9 months ago
- Automatically updated dump of Truth Social's source code (reskinned Mastodon)☆32Updated 10 months ago
- peckish (case-sensitive) is a CLI tool/Rust library for (re)packaging Linux software artifacts.☆46Updated 4 months ago
- URLTeam's second generation of URL shortener archiving tools☆75Updated last month
- A script that generates URL variations to test URL parsers with☆25Updated 2 years ago
- Archiving all metadata from YouTube (everything except videos themselves due to size)☆27Updated 2 months ago
- Scripts for Internet Archive☆12Updated this week
- ☆18Updated 5 years ago
- Downloads and imports Wikipedia page histories to a git repository☆34Updated 2 months ago
- Cloudflare Worker cron to sync Discord bot guild count to discord.bots.gg API☆10Updated 11 months ago