rgriffogoes / scraper-notebookLinks
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆29Updated 3 years ago
Alternatives and similar repositories for scraper-notebook
Users that are interested in scraper-notebook are comparing it to the libraries listed below
Sorting:
- a python library for accessing the ClickUp api☆75Updated last year
- ☆23Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆55Updated 2 months ago
- Containerized workflow automation tool☆22Updated last month
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- A collection of PDF command line tools and wrappers for Linux☆116Updated 2 years ago
- Combines Obsidian, mkdocs, gitea, and droneCI to create a compelling wiki solution. Blog post -☆39Updated 4 years ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆328Updated 2 years ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆39Updated 2 years ago
- Generate a list of your GitHub stars by topic - automatically!☆102Updated 3 years ago
- SingleFile docker implementation providing access via CLI and WEB service☆53Updated last year
- Unofficial Otter.ai Python API☆82Updated 2 months ago
- ScrapingAnt API client for Python.☆43Updated last year
- Simple wrapper script for Joplin CLI, for those that want to speedily make a note from terminal☆23Updated 7 years ago
- Apify actor to scrape Youtube search results. You can set the maximum videos to scrape per page as well as the date from which to start s…☆27Updated 3 years ago
- Datasette plugin to create interactive dashboards☆171Updated this week
- Lightweight, non-root, container image for the yt-dlp command line, including FFmpeg.☆25Updated 2 years ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆41Updated 7 years ago
- Terraform project that deploys VSCode Server on Oracle Cloud Infrastructure (free tier) and protect the access with Cloudflare Zero Trust…☆28Updated last week
- Projects, resources, and tutorials that take code-server to the next level☆191Updated last year
- A containerized desktop environment geared toward machine learning development.☆81Updated last year
- Hook toolkit for Paperless-ngx with a REST API client in written Go☆13Updated 3 weeks ago
- Tool to index and serve HTML files. Powered by Datasette.☆111Updated 3 years ago
- Podcast feed generator for existing tagged M4A or MP3 files☆15Updated 5 years ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 11 months ago
- Read files (pdf/png/jpg) with OCR and rename using AI.☆24Updated 2 years ago
- A customizable lightweight SQL query tool that works on tabular data, including Beancount.☆45Updated 5 months ago
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated 2 months ago
- Autonomous newsletter builder tool for Listmonk and Ghost Blog CMS. This GoLang App compiles a newsletter from an RSS feed and posts it t…☆26Updated 9 months ago
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆14Updated last year