paulpierre / markdown-crawlerLinks
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
β416Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below
Sorting:
- HTML to Markdown converter and crawler.β597Updated last year
- Yet another open source Perplexityβ460Updated last year
- β‘Chat with GitHub Repo Using 200k context window of Claude instead of RAG!β‘β169Updated last year
- Parse PDFs into markdown using Vision LLMsβ443Updated last month
- β239Updated 5 months ago
- 90% of what you need for LLM app development. Nothing you don't.β265Updated 2 months ago
- Easily deployable π API to convert PDF to markdown quickly with high accuracy.β923Updated last year
- π This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simpβ¦β260Updated 4 months ago
- Extract structured text from pdfs quicklyβ624Updated 5 months ago
- Social and customizable AI writing assistant! βοΈβ254Updated 4 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3β500Updated 3 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.β245Updated last year
- Your first AI prompt engineerβ412Updated 4 months ago
- LLM for Long Text Summary (Comprehensive Bulleted Notes)β601Updated 4 months ago
- β89Updated last year
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']β478Updated 10 months ago
- A Function Calls Proxy for Groq, the fastest AI alive!β205Updated last year
- Auto generate MindMap with ChatGPTβ267Updated last year
- β152Updated last year
- Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, β¦β490Updated this week
- Detect and extract tables to markdown and csvβ755Updated 10 months ago
- The simplest open-source implementation of perplexity.aiβ322Updated 10 months ago
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usinβ¦β214Updated last year
- Structured information extraction from documentsβ319Updated last year
- Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMsβ796Updated 9 months ago
- An experimental UI for text-to-knowledge-graph generationβ781Updated last year
- GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.β1,395Updated last year
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.β292Updated 4 months ago
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- A simple Python program to implement the search-extract-summarize flow.β275Updated 5 months ago