A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
β446Aug 13, 2024Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.β24Jan 12, 2025Updated last year
- Browser automation for creating new pages in WordPressβ13Jun 7, 2025Updated last year
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.aiβ1,953Jul 21, 2024Updated last year
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate hβ¦β22Nov 21, 2025Updated 6 months ago
- A tool to automatically create and run your Python scripts in a virtual environment with installed dependenciesβ19Jun 2, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Empower your script with auto_venv: Say Goodbye to Manual Setup or Install!β21Jun 20, 2024Updated last year
- Cookbook for Crafting Good Codeβ57Mar 19, 2024Updated 2 years ago
- word4num is a versatile tool for encoding numbers into words, applicable for geolocation, phone numbers, postcodes, IPv4 addresses, and mβ¦β12Oct 9, 2024Updated last year
- Plugin for Obsidian.md β Thesaurus, dictionary and more using the Datamuse APIβ56May 22, 2024Updated 2 years ago
- Creating Intelligent Terminal Apps with ChatGPT and LLMΒ Modelsβ30Jul 9, 2023Updated 2 years ago
- Search, modify, and parse messy HTML with ease.β41Jan 17, 2026Updated 4 months ago
- ε¦ιδ½ιͺtextinζζ‘£θ§£ζοΌθ―·ηΉε»https://cc.co/16YSIyβ22Jul 9, 2024Updated last year
- Containerized workflow automation toolβ22Updated this week
- β12Nov 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Wave Partial Differential Equation Solver in Pythonβ14Jun 5, 2024Updated 2 years ago
- Extract structured text from pdfs quicklyβ695Updated this week
- Examples of vector DB indexing and query with various vector databases.β13May 20, 2026Updated 3 weeks ago
- This is a Telegram Bot π€ using Flowise API call giving a lot of posibilities with langchain tecnology.β23Jun 27, 2024Updated last year
- An Obsidian plugin to set the Link Text using the document titleβ23Updated this week
- Use NavamAI to supercharge your productivity and workflow with personal, fast, and quality AI. Turn your Terminal into a configurable, inβ¦β26Oct 15, 2024Updated last year
- A modern shellβ355Nov 14, 2025Updated 7 months ago
- Incredibly descriptive audiovisual summaries for videosβ41Aug 2, 2024Updated last year
- Rust implementation of Suryaβ67Mar 1, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Convert HTML to Markdownβ2,196Nov 16, 2025Updated 6 months ago
- Python scraper based on AIβ27,062Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β11,175May 22, 2026Updated 3 weeks ago
- Obsidian plugin to toggle between `lowercase` `UPPERCASE` and `Title Case`β10Sep 10, 2024Updated last year
- Natural language browser automationβ636Dec 21, 2024Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.β69May 9, 2023Updated 3 years ago
- Convert PDF to markdown + JSON quickly with high accuracyβ36,101Jun 6, 2026Updated last week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ68,181Jun 4, 2026Updated last week
- β24Feb 12, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Personal AI search copilot, open-source Perplexityβ783Aug 7, 2025Updated 10 months ago
- Awesome list of awesome website from my bookmarks. Download bookmarks also.β11Jul 29, 2023Updated 2 years ago
- Awesome TTSβ63Sep 16, 2021Updated 4 years ago
- HTML to Markdown converter and crawler.β620Jan 9, 2024Updated 2 years ago
- The API to search, scrape, and interact with the web at scale. π₯β132,865Updated this week
- Turn any webpage into structured data using LLMsβ6,815Updated this week
- Add Google and Python documentation links to the bottom of exceptions.β29Nov 4, 2023Updated 2 years ago