internetarchive / ZenoLinks
State-of-the-art web crawler π±
β256Updated this week
Alternatives and similar repositories for Zeno
Users that are interested in Zeno are comparing it to the libraries listed below
Sorting:
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β298Updated this week
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.β256Updated 5 months ago
- wabac.js - Web Archive Browsing Augmentation Clientβ109Updated last week
- A tool for detecting viruses and NSFW material in WARC filesβ15Updated 10 months ago
- ArxivTok π: Browse ArXiv papers with a TikTok-style vertical swipe interface.β91Updated 5 months ago
- Scrape details about Code Interpreter to track any changesβ67Updated 3 months ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β34Updated 2 months ago
- A font for writing tiny storiesβ308Updated last year
- Python 3 tools for downloading and preserving wikisβ117Updated 2 weeks ago
- β95Updated 6 months ago
- Command line tool for digging into WARC filesβ43Updated 2 weeks ago
- LLM benchmark: Generate an SVG of a pelican riding a bicycleβ119Updated 7 months ago
- Command-line tool and Rust library for handling Web ARChive (WARC) filesβ20Updated last month
- model.yaml is an open standard for defining crossplatform, composable AI modelsβ37Updated 2 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β43Updated last week
- Securely run AI-generated code in stateful sandboxes that run forever.β205Updated 2 months ago
- Centralised repository for WARC usage specifications.β115Updated 7 months ago
- Generate files that are almost JPEGs with random data. Possibly useful in feeding aggressive web crawlers.β16Updated last week
- tinyAgent uniquely treats functions as first-class citizens, easily transforming them into powerful AI tools. Inspired by human organizatβ¦β71Updated last week
- Code for tldw.tubeβ391Updated 2 months ago
- Wombat.js client-side rewriting libraryβ100Updated last month
- Python SDK for Browserbaseβ58Updated this week
- Twitter (sometimes known as X) can look prettier with this simple AddOn!β117Updated 2 months ago
- A Memento Aggregator CLI and Server in Goβ65Updated 4 months ago
- VERT's solution to crappy video conversion services.β153Updated 2 months ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β54Updated last year
- β324Updated 4 months ago
- Blueprint to Build Your Own Timeline Algorithmβ60Updated last month
- Readable YouTube Transcripts using Gemini 1.5 Flash 8Bβ60Updated last month
- Detect whether or not an audio file was generated by NotebookLMβ138Updated 7 months ago