rgriffogoes / scraper-notebookLinks
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆27Updated 3 years ago
Alternatives and similar repositories for scraper-notebook
Users that are interested in scraper-notebook are comparing it to the libraries listed below
Sorting:
- a python library for accessing the ClickUp api☆75Updated last year
- ☆21Updated 2 years ago
- Simple wrapper script for Joplin CLI, for those that want to speedily make a note from terminal☆23Updated 7 years ago
- Lightweight, non-root, container image for the yt-dlp command line, including FFmpeg.☆25Updated 2 years ago
- Hook toolkit for Paperless-ngx with a REST API client in written Go☆13Updated last month
- Generate a list of your GitHub stars by topic - automatically!☆83Updated 2 years ago
- ScrapingAnt API client for Python.☆43Updated last year
- Scrape various open data directories to create an index of what's available out there☆37Updated 7 months ago
- A GitHub action for turning scanned PDF's into searchable documents☆15Updated 6 months ago
- SingleFile docker implementation providing access via CLI and WEB service☆49Updated last year
- Yet another tool to search through your (exported) ChatGPT conversations☆12Updated 11 months ago
- Mixpost Installation with Docker Containers☆12Updated 2 years ago
- Web page archive tool☆27Updated last week
- Tailscale in Docker without elevated privileges☆57Updated last year
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆39Updated 6 years ago
- Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start s…☆26Updated 3 years ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated 2 years ago
- ☆26Updated 4 years ago
- Scrape HN to track links from specific domains☆63Updated this week
- Build n8n nodes from OpenAPI specs and YAML files☆74Updated 3 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆50Updated last week
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆110Updated 2 weeks ago
- Unofficial Otter.ai Python API☆74Updated last year
- Projects, resources, and tutorials that take code-server to the next level☆183Updated last year
- Automatically sync Omnivore pages to Raindrop.io☆22Updated 10 months ago
- Tool to index and serve HTML files. Powered by Datasette.☆107Updated 3 years ago
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆74Updated 6 years ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆78Updated last month
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆14Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆66Updated last year