JavaScript Aware Web Archive Crawler (JAWA) (OSDI'22)
☆13Dec 21, 2022Updated 3 years ago
Alternatives and similar repositories for Jawa
Users that are interested in Jawa are comparing it to the libraries listed below
Sorting:
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆42Nov 24, 2025Updated 3 months ago
- A fast URL parser for Go☆40Mar 4, 2023Updated 3 years ago
- Some ideas on making Bags into Git repositories☆16Dec 23, 2014Updated 11 years ago
- Peer-to-peer NATS message routing and S3 object sync solution☆18Feb 5, 2026Updated last month
- ModemManager Test Framework☆11May 23, 2018Updated 7 years ago
- A distributed graph database system (GDBMS)☆11Feb 20, 2023Updated 3 years ago
- Toolkitty is a coordination app for collectives, organisers and venues. You can organise events, share resources and spaces in a collabor…☆55Aug 11, 2025Updated 6 months ago
- Read and write WARC files in Go☆49Updated this week
- Tools to analyze web archives☆20Jul 12, 2016Updated 9 years ago
- irMagician command line utility☆10Dec 18, 2014Updated 11 years ago
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Aug 10, 2018Updated 7 years ago
- linear algebra package. like gonum/mat, but small. lets say gonum-lite☆12Jul 8, 2023Updated 2 years ago
- This is a small demo of how to transform a simple single-server RocksDB service written in Rust into a distributed version using OmniPaxo…☆16Feb 5, 2025Updated last year
- A IIIF static tile and manifest generator built using Python to generate IIIF tiled images and manifests. This application was put toget…☆10Mar 2, 2026Updated last week
- Tools for batch QC analysis of digitized audio collections☆11Nov 7, 2025Updated 4 months ago
- dictd server bindings in go☆10Oct 1, 2016Updated 9 years ago
- ☆11Jan 10, 2025Updated last year
- Python script to archive Tweets☆12Oct 2, 2012Updated 13 years ago
- Extract texts + their page numbers from PDF☆12Nov 25, 2024Updated last year
- Python library for generating EnergyPlus inputs☆11Feb 23, 2026Updated 2 weeks ago
- ☆10Aug 9, 2023Updated 2 years ago
- RDF file extension for DuckDB. Reads well, writes in progress☆14Updated this week
- Scripts for data mining Twitter☆11Apr 3, 2016Updated 9 years ago
- Google Sheets to SQLite CLI tool.☆13Aug 15, 2023Updated 2 years ago
- Collaborative bibliography on 'experimental writing' and 'financial crisis' from the Mute archive - http://metamute.org/archive☆10Dec 11, 2022Updated 3 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- Docker for ScanTailor and ScanTailor Advanced☆14Mar 17, 2024Updated last year
- python text steganography library☆11Nov 29, 2022Updated 3 years ago
- Recreation of Kubernetes the Hard Way in containers instead of GCP.☆10Jun 29, 2020Updated 5 years ago
- Multipage OSD support for the V2 goggles with O3☆20Apr 12, 2025Updated 10 months ago
- Lua binding for the lol-HTML rewriter/parser☆19Nov 14, 2020Updated 5 years ago
- scanning script for the noisebridge book scanner☆14May 12, 2017Updated 8 years ago
- Simple utilities to analyze and manipulate MARC files☆12Updated this week
- xml_to_json(xml, indent) function☆13Dec 13, 2021Updated 4 years ago
- A BPMN engine. A WASM variant of lib-bpmn-engine. Playground and Showcase interactive BPMN modelling and execution.☆15Aug 30, 2023Updated 2 years ago
- Generate a js API for analytics based on a json file containing the business actions to track☆11Jan 22, 2019Updated 7 years ago
- ☆16Feb 23, 2026Updated 2 weeks ago
- A CLI tool to easily ssh login to EC2 instances selected by peco.☆11Jul 25, 2019Updated 6 years ago