simonw / scrape-hacker-news-by-domain
Scrape HN to track links from specific domains
☆55Updated this week
Alternatives and similar repositories for scrape-hacker-news-by-domain:
Users that are interested in scrape-hacker-news-by-domain are comparing it to the libraries listed below
- CLI for running files through AWS Textract☆54Updated 11 months ago
- Command-line tool for fetching JSON from paginated APIs☆66Updated last year
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆15Updated 3 weeks ago
- Scripts and ideas to manage tons and tons of images and movies☆16Updated last week
- Add website scraping abilities to Datasette☆62Updated 2 years ago
- Create a SQLite database containing your data from Google Calendar☆56Updated 2 years ago
- Using Datasette and CLIP embeddings to find similar faucets.☆22Updated last year
- GitHub template repository for creating new Python Click CLI tools, using the simonw/click-app cookiecutter template☆29Updated 10 months ago
- Create a SQLite database containing data from your Pocket account☆105Updated last year
- Wikidata's QRank as a SQLite DB.☆28Updated last year
- Datasette plugin that shows a map for any data with latitude/longitude columns☆94Updated 7 months ago
- A Twitter, Mastodon, and BlueSky bot that shares new interactive, graphic, and data vis stories from newsrooms around the world☆54Updated this week
- Tools for running OCR against files stored in S3☆119Updated 2 years ago
- Track changes to GraphQL APIs by git scraping their schemas☆28Updated this week
- "llm python" is a command to run a Python interpreter in the LLM virtual environment☆31Updated last year
- Datasette pre-configured with useful plugins. Experimental alpha.☆28Updated 9 months ago
- Datasette plugin for publishing data using Vercel☆44Updated 2 years ago
- A tool for creating credentials for accessing S3 buckets☆206Updated 3 months ago
- Secure, locally-run Retrieval-Augmented Generation system for document-based question-answering, utilizing Llama 3, Mistral, and Gemini m…☆22Updated 5 months ago
- Datasette plugin for rendering HTML based on JSON values☆26Updated 2 years ago
- A collection of prompts for use with the LLM CLI tool☆15Updated last year
- Scrape various open data directories to create an index of what's available out there☆36Updated last month
- Quality News - Towards a fairer ranking formula for Hacker News☆81Updated last week
- Python functions for flattening a JSON object to a single dictionary of pairs, and unflattening that dictionary back to a JSON object☆52Updated 6 months ago
- Git scrapers for scraping the fediverse☆15Updated this week
- CLI tool for running text through OpenAI Text to speech☆164Updated last year
- CLI tool for loading markdown files into a SQLite database☆83Updated 2 years ago
- Datasette plugin for rendering Markdown☆29Updated last year
- Create embeddings for LLM using the Nomic API☆22Updated 4 months ago