rgriffogoes / scraper-notebookLinks
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆27Updated 2 years ago
Alternatives and similar repositories for scraper-notebook
Users that are interested in scraper-notebook are comparing it to the libraries listed below
Sorting:
- Marp Editor for @standardnotes. Create presentations with Marp and Marpit Markdown | https://marpeditor.com☆33Updated 4 years ago
- Human-in-the-loop document classification☆10Updated 3 years ago
- LLM plugin for embeddings using sentence-transformers☆66Updated 2 months ago
- Add browser pages to your local YACY index☆15Updated 2 years ago
- A News Article Collection Library☆22Updated 2 years ago
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆69Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆59Updated 2 weeks ago
- Tools for interactive visual exploration of semantic embeddings.☆34Updated 9 months ago
- A Python3, async interface to the linkding REST API☆21Updated 4 months ago
- A collection of PDF command line tools and wrappers for Linux☆104Updated 2 years ago
- ☆37Updated 4 months ago
- ☆26Updated 4 years ago
- Dockerized workflow automation tool☆20Updated this week
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Datasette pre-configured with useful plugins. Experimental alpha.☆28Updated last year
- Daily TV News Summary using GPT☆24Updated last month
- ☆12Updated last year
- ☆18Updated 3 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 9 months ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆34Updated 6 years ago
- Yet another tool to search through your (exported) ChatGPT conversations☆12Updated 8 months ago
- Data API and micro orm for DuckDB and MotherDuck☆9Updated 6 months ago
- Hey is a powerful chatbot for the command line CLI that uses ChatGPT to generate commands based on natural language input☆41Updated 2 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 8 months ago
- AI powered command line☆36Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆14Updated last year
- Docker image for WhisperX by Max Bain☆12Updated 10 months ago
- 🐳 ⛅ Your own private cloud services with Docker☆44Updated last month
- Docker image of an enhanced version of n8n☆16Updated this week
- Datasette plugin for uploading CSV files and converting them to database tables☆26Updated last year