paulpierre / markdown-crawlerLinks
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
β408Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below
Sorting:
- HTML to Markdown converter and crawler.β594Updated last year
- Yet another open source Perplexityβ456Updated last year
- Parse PDFs into markdown using Vision LLMsβ439Updated 3 weeks ago
- 90% of what you need for LLM app development. Nothing you don't.β264Updated 2 months ago
- Easily deployable π API to convert PDF to markdown quickly with high accuracy.β917Updated last year
- β238Updated 4 months ago
- β89Updated last year
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3β499Updated 2 months ago
- An experimental UI for text-to-knowledge-graph generationβ781Updated last year
- LLM for Long Text Summary (Comprehensive Bulleted Notes)β599Updated 3 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β245Updated last year
- Visualize Different Text Splitting Methodsβ300Updated 10 months ago
- Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMsβ796Updated 8 months ago
- A simple Python program to implement the search-extract-summarize flow.β273Updated 4 months ago
- β‘Chat with GitHub Repo Using 200k context window of Claude instead of RAG!β‘β169Updated last year
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β528Updated this week
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, β¦β487Updated this week
- π This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simpβ¦β255Updated 3 months ago
- Clone of https://r.jina.ai which is deployable locallyβ48Updated last year
- Prompt optimization scratchβ863Updated 6 months ago
- An innovative open-source Code Interpreter with (GPT,Gemini,Claude,LLaMa) models.β275Updated 5 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.β384Updated last year
- β274Updated last year
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- Your first AI prompt engineerβ412Updated 4 months ago
- β197Updated this week
- clean & curate your data with LLMs.β490Updated last year
- Social and customizable AI writing assistant! βοΈβ253Updated 4 months ago
- No-code ETL and data pipelines with AI and NLPβ317Updated 8 months ago
- ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milliβ¦β178Updated 11 months ago