A Rust library for reading and writing WARC files
☆59Nov 27, 2024Updated last year
Alternatives and similar repositories for warc
Users that are interested in warc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- CDXJ Indexing of WARC/ARCs☆34May 11, 2026Updated 3 weeks ago
- Texting Robots: A Rust native `robots.txt` parser with thorough unit testing☆28Feb 14, 2024Updated 2 years ago
- Parse WARC (Web Archive Files) as a node.js stream☆23Oct 20, 2014Updated 11 years ago
- ☆14Mar 20, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Centralised repository for WARC usage specifications.☆128Apr 4, 2026Updated 2 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆98Apr 22, 2025Updated last year
- 🗄️ A simple CLI for converting WARC to Parquet.☆116Feb 12, 2025Updated last year
- Scoop by Rusty Foster and the CMF running Kuro5hin and other websites☆12Apr 14, 2017Updated 9 years ago
- Converts HTTrack crawls to WARC files☆34Aug 6, 2024Updated last year
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 6 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆20Jul 11, 2025Updated 10 months ago
- Wombat.js client-side rewriting library☆119Apr 29, 2026Updated last month
- A dockerized, queued high fidelity web archiver based on Squidwarc☆62Jul 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Web archive index server based on RocksDB☆43Updated this week
- A tool for detecting viruses and NSFW material in WARC files☆18Updated this week
- Tests for the Nintendo 3DS's ARM9 security processor☆24Dec 25, 2019Updated 6 years ago
- Einstein summation for Rust☆40Apr 8, 2021Updated 5 years ago
- Read and write WARC files in Go☆49Updated this week
- Command-line tool and Rust library for handling Web ARChive (WARC) files☆31Jun 2, 2025Updated last year
- A repository to organize materials from the AI4LAM Teach and Learning Working Group☆14May 5, 2023Updated 3 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆132Nov 21, 2025Updated 6 months ago
- A typed Rust library for easily interacting with and consuming the Bluesky Jetstream service.☆51Apr 10, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Apr 29, 2026Updated last month
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 7 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 8 years ago
- A framework for creating digital exhibits by loading collection metadata directly from a CSV (such as a published Google Sheet!). See the…☆14Feb 20, 2026Updated 3 months ago
- Revealing the Omitted - An Exploration of Media Bias in the news coverage of Obamacare. Employs Selenium and BeautifulSoup to scrape over…☆17Feb 9, 2019Updated 7 years ago
- Pure Elixir disk backed key-value store.☆29Jan 28, 2026Updated 4 months ago
- A crate built on top of `axum-sessions`, implementing the CSRF Synchronizer Token Pattern☆15Updated this week
- A repository containing all of my custom keyboards for iOS☆14Jan 2, 2021Updated 5 years ago
- Webrecorders DevTools Protocol Automation Library☆18Oct 18, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Internet Archive's Sparkling Data Processing Library☆16May 4, 2026Updated last month
- File Manager built with egui in Rust☆20Jun 2, 2026Updated last week
- Warcbase is an open-source platform for managing analyzing web archives☆162Dec 8, 2017Updated 8 years ago
- game in rust-lang☆12Feb 19, 2016Updated 10 years ago
- Tool and library for handling Web ARChive (WARC) files.☆165Oct 11, 2024Updated last year
- Chrome Debugging Protocol interface for python asyncio☆14Oct 31, 2020Updated 5 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago