A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
☆442Aug 13, 2024Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.☆24Jan 12, 2025Updated last year
- A universal solution for web crawling lists. 抓取网页列表的通用解决方案☆107Jun 5, 2024Updated last year
- Browser automation for creating new pages in WordPress☆13Jun 7, 2025Updated 11 months ago
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai☆1,940Jul 21, 2024Updated last year
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate h…☆22Nov 21, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A tool to automatically create and run your Python scripts in a virtual environment with installed dependencies☆19Apr 9, 2026Updated last month
- Empower your script with auto_venv: Say Goodbye to Manual Setup or Install!☆21Jun 20, 2024Updated last year
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- word4num is a versatile tool for encoding numbers into words, applicable for geolocation, phone numbers, postcodes, IPv4 addresses, and m…☆12Oct 9, 2024Updated last year
- Plugin for Obsidian.md — Thesaurus, dictionary and more using the Datamuse API☆55May 22, 2024Updated 2 years ago
- Creating Intelligent Terminal Apps with ChatGPT and LLM Models☆30Jul 9, 2023Updated 2 years ago
- Search, modify, and parse messy HTML with ease.☆41Jan 17, 2026Updated 4 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on the scraped content. The bo…☆88Feb 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Nov 5, 2024Updated last year
- Wave Partial Differential Equation Solver in Python☆14Jun 5, 2024Updated last year
- Extract structured text from pdfs quickly☆686Jun 11, 2025Updated 11 months ago
- Examples of vector DB indexing and query with various vector databases.☆13May 20, 2026Updated last week
- This is a Telegram Bot 🤖 using Flowise API call giving a lot of posibilities with langchain tecnology.☆23Jun 27, 2024Updated last year
- An Obsidian plugin to set the Link Text using the document title☆23May 18, 2026Updated last week
- Use NavamAI to supercharge your productivity and workflow with personal, fast, and quality AI. Turn your Terminal into a configurable, in…☆26Oct 15, 2024Updated last year
- A modern shell☆355Nov 14, 2025Updated 6 months ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Rust implementation of Surya☆67Mar 1, 2025Updated last year
- Python scraper based on AI☆25,579May 17, 2026Updated last week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,879May 19, 2026Updated last week
- FinGPT is an AI language model designed to understand and generate financial content. Built upon the GPT (Generative Pre-trained Transfor…☆13Nov 14, 2025Updated 6 months ago
- Obsidian plugin to toggle between `lowercase` `UPPERCASE` and `Title Case`☆10Sep 10, 2024Updated last year
- Natural language browser automation☆637Dec 21, 2024Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69May 9, 2023Updated 3 years ago
- Convert PDF to markdown + JSON quickly with high accuracy☆35,381May 5, 2026Updated 3 weeks ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆66,299Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Feb 12, 2024Updated 2 years ago
- Personal AI search copilot, open-source Perplexity☆784Aug 7, 2025Updated 9 months ago
- Awesome list of awesome website from my bookmarks. Download bookmarks also.☆11Jul 29, 2023Updated 2 years ago
- HTML to Markdown converter and crawler.☆620Jan 9, 2024Updated 2 years ago
- 🔥 Search, scrape, and clean the web for AI agents.☆123,489Updated this week
- Turn any webpage into structured data using LLMs☆6,740Apr 13, 2026Updated last month
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆24Dec 9, 2023Updated 2 years ago