internetarchive / cdx-summary
Summarize web archive capture index (CDX) files.
☆65Updated 2 years ago
Alternatives and similar repositories for cdx-summary:
Users that are interested in cdx-summary are comparing it to the libraries listed below
- Author Tools☆38Updated 3 weeks ago
- Legal tool (licenses, public domain dedication, etc.) data for Creative Commons☆33Updated last week
- Web-based whois gateway written in Python for lighttpd☆24Updated last year
- Downloadable MARC records from the U.S. Government Publishing Office.☆58Updated 6 months ago
- IETF Mail List Archives☆43Updated 2 months ago
- A Memento TimeGate☆41Updated 4 years ago
- Reference implementation of proposed Global Privacy Control standard written in Node/Express with sample code and instructions.☆66Updated 9 months ago
- 🎛 Configuration files used by DuckDuckGo's apps and extensions to control which privacy protections are enabled.☆138Updated this week
- Web program to see the number of links to a page in any Wikimedia project.☆83Updated 6 months ago
- WHATWG Standard repository templates and infrastructure☆38Updated 3 months ago
- Mirror from https://gerrit.wikimedia.org/g/analytics/wikistats2☆96Updated 2 weeks ago
- Convert text-format RFCs and Internet-Drafts to html☆34Updated 5 months ago
- A repo to manage and publish W3C Community and Business Groups final reports☆26Updated 2 months ago
- CC Legal Database: curated repository of Case Law and Scholarship data from around the world in a Django based website.☆50Updated 2 months ago
- The link service is used to create links to content and metadata on govinfo☆93Updated 11 months ago
- Generate RFCs and IETF drafts from document source in XML according to the IETF xml2rfc v2 and v3 vocabularies☆78Updated 3 weeks ago
- Open Collective's REST API legacy, v1, and v2!☆51Updated this week
- Convert mapping from Google Spreadsheet CSV export to regexp YAML file☆24Updated 4 years ago
- Hotfixes for Refined GitHub☆32Updated 2 weeks ago
- Legal tool (licenses, public domain dedication, etc.) management application for Creative Commons☆111Updated 2 weeks ago
- A collection of user scripts and Tool Labs tools intended for users of Wikimedia Foundation wikis.☆47Updated last month
- Workflow to manage volunteer translations of W3C documents☆37Updated last week
- Repository for the maintenance of the schema.org accessibility property values for discoverability.☆20Updated last week
- Checker used at W3C to validate the compliance of Technical Reports with publication rules☆87Updated this week
- Forward to Libraries service (selected code and data)☆20Updated last week
- Wikipedia 1.0 engine & selection tools☆31Updated this week
- Web archive index server based on RocksDB☆34Updated 4 months ago
- Framework of tools and libraries for building and running bots on Wikipedia☆20Updated 3 months ago
- W3C-customized version of the feedvalidator (forked from https://github.com/rubys/feedvalidator/)☆90Updated last week
- ☆45Updated last year