rgriffogoes / scraper-notebook
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆26Updated 2 years ago
Alternatives and similar repositories for scraper-notebook:
Users that are interested in scraper-notebook are comparing it to the libraries listed below
- Crawl a website to generate knowledge file for RAG☆28Updated 5 months ago
- The GitBook documentation site for OpenAlex☆18Updated last week
- A Python3, async interface to the linkding REST API☆17Updated this week
- A News Article Collection Library☆22Updated last year
- A financial disclosure data extraction tool.☆13Updated last year
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated last year
- this is a collections of good docker contaienrs iv collected over the time , if you have portainer installed i have an app template that …☆13Updated last year
- AI powered command line☆36Updated last year
- Daily TV News Summary using GPT☆22Updated 2 months ago
- Add browser pages to your local YACY index☆15Updated last year
- ReadablePDF streamlines the effort of turning a not so great PDF into a more easily readable PDF (or of course a pretty decent PDF into a…☆33Updated 3 years ago
- 🐳 ⛅ Your own private cloud services with Docker☆37Updated this week
- Human-in-the-loop document classification☆10Updated 3 years ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆22Updated 6 years ago
- Webinterface for administrating Ollama and model Quantization with public endpoints and automized OPENAI proxy☆52Updated 8 months ago
- ScrapingAnt API client for Python.☆36Updated 6 months ago
- Read files (pdf/png/jpg) with OCR and rename using AI.☆21Updated last year
- Use Readwise Reader 📖 in the Command-line (CLI) 💻☆21Updated last year
- Simple wrapper script for Joplin CLI, for those that want to speedily make a note from terminal☆23Updated 6 years ago
- A super simple and helpful way to add websites to monitor☆15Updated 10 months ago
- Hubcap is an autonomous AI agent in 25 lines of code: a small Autobot that you can't trust☆39Updated last year
- LLM plugin for embeddings using sentence-transformers☆44Updated 11 months ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆28Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- Small and simple directory synchronizer (a BASH script)☆83Updated last year
- Datasette pre-configured with useful plugins. Experimental alpha.☆28Updated 7 months ago
- AppleScripts, services, and other utilities which make my life on macOS easier☆16Updated 4 months ago
- Marp Editor for @standardnotes. Create presentations with Marp and Marpit Markdown | https://marpeditor.com☆29Updated 4 years ago
- A markdown-supported command-line interface tool that connects to ChatGPT using OpenAI's API key.☆47Updated last year
- KonMari your Pocket tsundoku from the command line☆16Updated last year