arquivo / pwa-technologies
Arquivo.pt main goal is the preservation and access of web contents that are no longer available online. During the developing of the PWA IR (information retrieval) system we faced limitations in searching speed, quality of results, scalability and usability. To cope with this, we modified the archive-access project (http://archive-access.sourc…
☆45Updated 3 months ago
Alternatives and similar repositories for pwa-technologies
Users that are interested in pwa-technologies are comparing it to the libraries listed below
Sorting:
- A tool for collection archival slivers of the web and web archives☆13Updated 3 months ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆15Updated 3 years ago
- Static Site Generator for Viewing Web Archives (in WACZ) format☆26Updated last year
- Command line tool for digging into WARC files☆40Updated this week
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- CDXJ Indexing of WARC/ARCs☆25Updated 5 months ago
- A social media open post web archiving tool☆25Updated last week
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Trough: Big data, small databases.☆41Updated 9 months ago
- Converts WARC files to static HTML☆44Updated 10 months ago
- Support for writing WARC files with Scrapy☆21Updated 5 years ago
- Comparing warc files☆17Updated 6 years ago
- Scripts for Wikidata☆20Updated last month
- Specification for authentication and creating signed WACZ Files☆10Updated 3 years ago
- ☆10Updated 3 years ago
- A Memento TimeGate☆43Updated 5 years ago
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago
- A Rails engine supporting the discovery of web archives.☆50Updated last year
- A Github Action for turning Markdown into ReSpec HTML☆14Updated 11 months ago
- Create and edit WARC and WACZ files☆10Updated 5 months ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.☆54Updated last year
- Generate Wikidata property statistics dashboards, to be used by Wikiprojects.☆10Updated last week
- Scraper for German democracy documents☆37Updated last year
- Nondestructive warc-in-tar to warc conversion☆26Updated 12 years ago
- Tools for helping you work with web platform archive downloads.☆17Updated 5 years ago
- ☆11Updated last month
- React components to render differences between captures at the Wayback Machine☆33Updated 2 weeks ago
- Searchable Linkable Open Public Indexed (SLOPI) Communication☆19Updated 2 years ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated last year
- Scripts for Internet Archive☆13Updated last month