A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
β441Aug 13, 2024Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.β24Jan 12, 2025Updated last year
- Browser automation for creating new pages in WordPressβ13Jun 7, 2025Updated 10 months ago
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.aiβ1,928Jul 21, 2024Updated last year
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate hβ¦β22Nov 21, 2025Updated 5 months ago
- A tool to automatically create and run your Python scripts in a virtual environment with installed dependenciesβ19Apr 9, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Empower your script with auto_venv: Say Goodbye to Manual Setup or Install!β21Jun 20, 2024Updated last year
- Cookbook for Crafting Good Codeβ57Mar 19, 2024Updated 2 years ago
- word4num is a versatile tool for encoding numbers into words, applicable for geolocation, phone numbers, postcodes, IPv4 addresses, and mβ¦β12Oct 9, 2024Updated last year
- Plugin for Obsidian.md β Thesaurus, dictionary and more using the Datamuse APIβ55May 22, 2024Updated last year
- Creating Intelligent Terminal Apps with ChatGPT and LLMΒ Modelsβ30Jul 9, 2023Updated 2 years ago
- Search, modify, and parse messy HTML with ease.β41Jan 17, 2026Updated 3 months ago
- Containerized workflow automation toolβ22Apr 22, 2026Updated 2 weeks ago
- β12Nov 5, 2024Updated last year
- Wave Partial Differential Equation Solver in Pythonβ14Jun 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Extract structured text from pdfs quicklyβ684Jun 11, 2025Updated 10 months ago
- An Obsidian plugin to set the Link Text using the document titleβ23Jan 3, 2026Updated 4 months ago
- Use NavamAI to supercharge your productivity and workflow with personal, fast, and quality AI. Turn your Terminal into a configurable, inβ¦β26Oct 15, 2024Updated last year
- A modern shellβ355Nov 14, 2025Updated 5 months ago
- Incredibly descriptive audiovisual summaries for videosβ41Aug 2, 2024Updated last year
- Rust implementation of Suryaβ66Mar 1, 2025Updated last year
- Convert HTML to Markdownβ2,164Nov 16, 2025Updated 5 months ago
- Python scraper based on AIβ23,444Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β10,745Updated this week
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- FinGPT is an AI language model designed to understand and generate financial content. Built upon the GPT (Generative Pre-trained Transforβ¦β13Nov 14, 2025Updated 5 months ago
- Obsidian plugin to toggle between `lowercase` `UPPERCASE` and `Title Case`β10Sep 10, 2024Updated last year
- Natural language browser automationβ631Dec 21, 2024Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.β69May 9, 2023Updated 2 years ago
- Convert PDF to markdown + JSON quickly with high accuracyβ34,606Apr 24, 2026Updated last week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ64,964Updated this week
- β24Feb 12, 2024Updated 2 years ago
- Personal AI search copilot, open-source Perplexityβ785Aug 7, 2025Updated 9 months ago
- Live demo of shot-scraperβ41Mar 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Add Google and Python documentation links to the bottom of exceptions.β28Nov 4, 2023Updated 2 years ago
- π₯ The API to search, scrape, and interact with the web for AIβ113,973Updated this week
- HTML to Markdown converter and crawler.β618Jan 9, 2024Updated 2 years ago
- Simple frontend for Google Custom Search Engineβ13Apr 17, 2024Updated 2 years ago
- Turn any webpage into structured data using LLMsβ6,360Apr 13, 2026Updated 3 weeks ago
- Integrated LLM-based document and data Q&A with knowledge graph visualizationβ24Dec 9, 2023Updated 2 years ago
- A Toolkit for Creating and Deploying LangChain Appsβ167May 3, 2023Updated 3 years ago