pjamar / htmls-to-datasette
Tool to index and serve HTML files. Powered by Datasette.
☆95Updated 2 years ago
Alternatives and similar repositories for htmls-to-datasette:
Users that are interested in htmls-to-datasette are comparing it to the libraries listed below
- Python script to extract news from RSS feeds and save it as json.☆18Updated 2 years ago
- ☆38Updated last year
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆64Updated this week
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆50Updated this week
- A dockerized, queued high fidelity web archiver based on Squidwarc☆57Updated 7 months ago
- Stupidly simple DIY web archiving tool☆33Updated 4 months ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆147Updated last week
- 🦛 scrapes websites and generates rss feeds☆53Updated this week
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆18Updated last year
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆39Updated 4 months ago
- Export your Github activity: events, repositories, stars, etc.☆50Updated last year
- A light weight feed reader that runs in your browser, with no backend☆49Updated 4 months ago
- Give me a website, I'll make you an epub.☆22Updated 2 years ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆125Updated 2 months ago
- 🔖 Run linkding on fly.io. Backup the bookmark DB to cloud storage with litestream.☆72Updated 7 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆38Updated last month
- SingleFile docker implementation providing access via CLI and WEB service☆43Updated 7 months ago
- A modified version of searx (the privacy-respecting metasearch engine) to only search an allowlist of sites, to build functionality simil…☆19Updated 3 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆16Updated 4 months ago
- Convert an online sitemap to Atom, RSS and JSON feeds☆61Updated last year
- Strip non-presentational content out of HTML pages☆45Updated 2 years ago
- A barebones web-based imitation of nvALT, written in Svelte and backed by RxDB☆16Updated 2 years ago
- Create a SQLite database containing data from your Pocket account☆102Updated last year
- A Mac application that uses the SingleFile repository.☆57Updated 3 years ago
- A self-hosted bookmark database with full-text page content search☆87Updated last year
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆30Updated 5 months ago
- A map of Half Moon Bay☆15Updated last year
- Self-hostable link database☆76Updated this week
- Export your personal Spotify data: playlists, saved tracks/albums/shows, etc. as JSON☆37Updated last year
- 📑 A way to automatically sync data with your kindle, such as RSS feeds, manga, and too much more.☆39Updated last year