rgriffogoes / scraper-notebookLinks
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆28Updated 3 years ago
Alternatives and similar repositories for scraper-notebook
Users that are interested in scraper-notebook are comparing it to the libraries listed below
Sorting:
- a python library for accessing the ClickUp api☆75Updated last year
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆39Updated 6 years ago
- Containerized workflow automation tool☆21Updated last week
- Marp Editor for @standardnotes. Create presentations with Marp and Marpit Markdown | https://marpeditor.com☆35Updated 4 years ago
- The GitBook documentation site for OpenAlex☆23Updated last week
- LLM plugin for embeddings using sentence-transformers☆72Updated 7 months ago
- Web page archive tool☆27Updated 2 months ago
- Simple bash script to shorten URLs with YOURLS☆13Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆52Updated this week
- Parse markdown article, download images and replace images URL's with local paths☆125Updated last month
- Automatically sync Omnivore pages to Raindrop.io☆22Updated last year
- Combines Obsidian, mkdocs, gitea, and droneCI to create a compelling wiki solution. Blog post -☆39Updated 4 years ago
- Unofficial Otter.ai Python API☆76Updated last week
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- ☆22Updated 2 years ago
- How to guides on web-crawling or scraping☆24Updated 7 months ago
- Hook toolkit for Paperless-ngx with a REST API client in written Go☆13Updated 3 weeks ago
- A News Article Collection Library☆22Updated 2 years ago
- Command line tool for converting CSV files into Markdown tables.☆136Updated last week
- A collection of PDF command line tools and wrappers for Linux☆112Updated 2 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆150Updated 3 weeks ago
- Tailscale in Docker without elevated privileges☆57Updated last year
- Generate a list of your GitHub stars by topic - automatically!☆101Updated 2 years ago
- this is a collections of good docker contaienrs iv collected over the time , if you have portainer installed i have an app template that …☆13Updated 2 years ago
- Scrape HN to track links from specific domains☆68Updated last week
- Terraform project that deploys VSCode Server on Oracle Cloud Infrastructure (free tier) and protect the access with Cloudflare Zero Trust…☆27Updated 2 weeks ago
- Reads HTML files, converting tables into CSV files☆31Updated 5 years ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 9 months ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated 2 years ago
- SingleFile docker implementation providing access via CLI and WEB service☆51Updated last year