arquivo / pwa-technologiesLinks
Arquivo.pt main goal is the preservation and access of web contents that are no longer available online. During the developing of the PWA IR (information retrieval) system we faced limitations in searching speed, quality of results, scalability and usability. To cope with this, we modified the archive-access project (http://archive-access.sourc…
☆48Updated last month
Alternatives and similar repositories for pwa-technologies
Users that are interested in pwa-technologies are comparing it to the libraries listed below
Sorting:
- Converts WARC files to static HTML☆48Updated last year
- Command line tool for digging into WARC files☆45Updated last week
- Centralised repository for WARC usage specifications.☆117Updated 9 months ago
- CDXJ Indexing of WARC/ARCs☆28Updated 8 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆48Updated last week
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago
- A social media open post web archiving tool☆27Updated 2 months ago
- wabac.js - Web Archive Browsing Augmentation Client☆113Updated 3 weeks ago
- A tool for collection archival slivers of the web and web archives☆14Updated 6 months ago
- A Rails engine supporting the discovery of web archives.☆50Updated 2 years ago
- A Memento Aggregator CLI and Server in Go☆68Updated 6 months ago
- Web archive index server based on RocksDB☆35Updated last month
- Scraper for German democracy documents☆37Updated last year
- Comparing warc files☆17Updated 6 years ago
- Specifications developed and maintained by the Webrecorder community.☆136Updated 7 months ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- ☆52Updated last year
- export data from twitter archive and visualize it☆25Updated 2 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆69Updated 5 months ago
- The repo for the PetScan tool☆55Updated last month
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆128Updated last month
- React components to render differences between captures at the Wayback Machine☆35Updated 4 months ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated last year
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- Synchronize your Mastodon bookmarks to bookmarking services.☆13Updated this week
- search interface for scholarly works☆86Updated last year
- Perpetual Access To The Scholarly Record☆120Updated last year
- Create and edit WARC and WACZ files☆14Updated 8 months ago
- Tools for helping you work with web platform archive downloads.☆18Updated 5 years ago
- Website of the Fediverse Discovery Providers project☆18Updated 2 months ago