Read and write WARC files in Go
β49Mar 16, 2026Updated this week
Alternatives and similar repositories for gowarc
Users that are interested in gowarc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Summarize and ask questions about items in the Internet Archiveβ18Apr 1, 2023Updated 2 years ago
- State-of-the-art web crawler π±β394Updated this week
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 8 months ago
- Read and write WARC files in Goβ50Apr 9, 2018Updated 7 years ago
- A fast URL parser for Goβ40Mar 4, 2023Updated 3 years ago
- Yet another frontend for Uglysearch with neobrutalism designβ20Feb 25, 2026Updated 3 weeks ago
- Command line tool for digging into WARC filesβ51Feb 27, 2026Updated 3 weeks ago
- Span formats.β16Updated this week
- CDXJ Indexing of WARC/ARCsβ33Dec 10, 2024Updated last year
- A dockerized, queued high fidelity web archiver based on Squidwarcβ62Jul 9, 2024Updated last year
- A ServiceWorker for client-side reconstruction of composite mementosβ16Mar 6, 2025Updated last year
- Web archiving using Google Chromeβ46Dec 30, 2019Updated 6 years ago
- ArchiveWeb.page Express!β14Nov 1, 2024Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- JavaScript Aware Web Archive Crawler (JAWA) (OSDI'22)β13Dec 21, 2022Updated 3 years ago
- Centralised repository for WARC usage specifications.β125Oct 12, 2025Updated 5 months ago
- Docker for ScanTailor and ScanTailor Advancedβ14Mar 17, 2024Updated 2 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) formatβ56Oct 21, 2018Updated 7 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β56Feb 10, 2026Updated last month
- Process lines in parallel.β21Jan 23, 2025Updated last year
- Examples of writing policies in regoβ13Oct 1, 2020Updated 5 years ago
- Add your configs for tmuxβ18Apr 3, 2022Updated 3 years ago
- DuckDB Engine as Google Sheets Libraryβ20Dec 14, 2024Updated last year
- OAI-PMH harvester in shell.β17Dec 23, 2025Updated 3 months ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wikiβ28Jul 31, 2024Updated last year
- A Rust library for reading and writing WARC filesβ59Nov 27, 2024Updated last year
- β17Mar 31, 2025Updated 11 months ago
- Go types of schema.org ontologyβ11Oct 13, 2024Updated last year
- Comparing warc filesβ17Feb 21, 2019Updated 7 years ago
- Golang WARC (Web ARChive) Libraryβ30Aug 6, 2019Updated 6 years ago
- Simple cross-compiling with Cosmopolitan libcβ24Dec 27, 2022Updated 3 years ago
- Wombat.js client-side rewriting libraryβ119Feb 3, 2026Updated last month
- Public domain MAKE tool for DOS 16-bit (real mode) and 8086/88 CPU. Designed for Small C by J. Hendrix or any other languages.β32Mar 25, 2025Updated 11 months ago
- A CLI tool that generates IIIF Presentation 2.1 Manifests from METS/MODSβ24Apr 17, 2025Updated 11 months ago
- A command line utility for listing and searching snapshots in web archivesβ17Dec 21, 2023Updated 2 years ago
- β16Dec 13, 2014Updated 11 years ago
- changes tab switching atom behaviorβ11Apr 27, 2016Updated 9 years ago
- Sort-friendly URI Reordering Transform (SURT) python moduleβ45Sep 11, 2025Updated 6 months ago
- Fork of PaperCraft for smartphonesβ12Jun 13, 2018Updated 7 years ago