rgriffogoes / scraper-notebook
Jupyter Docker stack image with pre-installer scraper tools and libraries
☆26Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for scraper-notebook
- A collection of PDF command line tools and wrappers for Linux☆92Updated last year
- Filter RSS Feed with GPT-4☆16Updated last year
- Crawl a website to generate knowledge file for RAG☆19Updated 3 months ago
- Daily TV News Summary using GPT☆21Updated 7 months ago
- ☆21Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆52Updated 3 weeks ago
- A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file.☆22Updated 5 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 2 months ago
- Jurisdiction ID and abbreviation data files for using with Jurism and other projects.☆34Updated last year
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆18Updated last year
- Multicolumn support for pandoc's markdown☆54Updated last year
- Read files (pdf/png/jpg) with OCR and rename using AI.☆20Updated last year
- Puppeteer automation through n8n☆16Updated 2 years ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆16Updated last year
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Top 15K of GitHub's finest.☆55Updated this week
- Filter your current RSS feeds with AI customized recommendations.☆23Updated last month
- A streamlit component for graph visualization☆30Updated 2 years ago
- LLM plugin for embeddings using sentence-transformers☆43Updated 9 months ago
- Markdown text to a novel in ePub and PDF.☆41Updated 2 years ago
- Lets to use local llms in your Obsidian Vaults, create new texts from your prompts and crate texts based on your inputs☆37Updated 4 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆10Updated last year
- Webinterface for administrating Ollama and model Quantization with public endpoints and automized OPENAI proxy☆51Updated 6 months ago
- Sync GitHub starred repos to a Raindrop.io collection☆49Updated 5 months ago
- Podcast feed generator for existing tagged M4A or MP3 files☆14Updated 4 years ago
- Human-in-the-loop document classification☆10Updated 3 years ago
- Automatically sync Omnivore pages to Raindrop.io☆21Updated 3 months ago
- Python based Wikidata framework for easy dataframe extraction☆39Updated 11 months ago
- Shell wrapper for OpenAI's ChatGPT, DALL-E, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Mistral, Groq, Anthropic, and Novita AI i…☆69Updated this week
- Explore your activity on Google with R: How to Analyze and Visualize Your Personal Data Search History. Find out how and how much you hav…☆12Updated 3 years ago