rgriffogoes / scraper-notebookLinks
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆28Updated 3 years ago
Alternatives and similar repositories for scraper-notebook
Users that are interested in scraper-notebook are comparing it to the libraries listed below
Sorting:
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆53Updated 3 weeks ago
- Unofficial Otter.ai Python API☆79Updated last month
- A financial disclosure data extraction tool.☆18Updated 2 years ago
- Simple wrapper script for Joplin CLI, for those that want to speedily make a note from terminal☆23Updated 7 years ago
- ☆56Updated 2 years ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆40Updated 6 years ago
- ollama-obsidian-indexer☆76Updated last year
- Containerized workflow automation tool☆21Updated this week
- ☆12Updated last year
- a python library for accessing the ClickUp api☆75Updated last year
- ☆25Updated 5 months ago
- Web page archive tool☆26Updated 3 months ago
- Automatically sync Omnivore pages to Raindrop.io☆21Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated this week
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated last year
- Yet another tool to search through your (exported) ChatGPT conversations☆12Updated last year
- Hook toolkit for Paperless-ngx with a REST API client in written Go☆13Updated 2 weeks ago
- Terraform project that deploys VSCode Server on Oracle Cloud Infrastructure (free tier) and protect the access with Cloudflare Zero Trust…☆28Updated last week
- Marp Editor for @standardnotes. Create presentations with Marp and Marpit Markdown | https://marpeditor.com☆34Updated 4 years ago
- ☆23Updated 3 years ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Parse government documents into well formed JSON☆75Updated this week
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 4 years ago
- Python Wrapper on top of Unofficial Medium API to quickly extract data from Medium's website.☆58Updated 5 months ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆16Updated 2 years ago
- ☆14Updated 2 months ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆30Updated 2 years ago
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆116Updated 3 months ago
- An open interface to GDELT APIs☆62Updated 2 years ago
- Python bindings for Upwork API (OAuth2)☆44Updated last year