paulpierre / markdown-crawlerLinks
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
β421Updated last year
Alternatives and similar repositories for markdown-crawler
Users that are interested in markdown-crawler are comparing it to the libraries listed below
Sorting:
- Yet another open source Perplexityβ460Updated last year
- HTML to Markdown converter and crawler.β603Updated last year
- Parse PDFs into markdown using Vision LLMsβ452Updated 2 months ago
- Easily deployable π API to convert PDF to markdown quickly with high accuracy.β925Updated last year
- This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint slides using the Generative Pre-trained Tβ¦β145Updated last year
- SearchGPT / Perplexity Pages clone, but personalised for you.β245Updated last year
- Your first AI prompt engineerβ411Updated 5 months ago
- 90% of what you need for LLM app development. Nothing you don't.β265Updated 3 months ago
- β‘Chat with GitHub Repo Using 200k context window of Claude instead of RAG!β‘β169Updated last year
- Clone of https://r.jina.ai which is deployable locallyβ49Updated last year
- Visualize Different Text Splitting Methodsβ309Updated 11 months ago
- π This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simpβ¦β268Updated 4 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3β501Updated 4 months ago
- β241Updated 6 months ago
- openperplex is an opensource AI search engineβ886Updated last year
- Octogen is an Open-Source Code Interpreter Agent Frameworkβ257Updated last year
- Structured information extraction from documentsβ319Updated last year
- β90Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.β385Updated last year
- An innovative open-source Code Interpreter with (GPT,Gemini,Claude,LLaMa) models.β276Updated 6 months ago
- Command your browser with GPTβ421Updated 3 weeks ago
- Co-create PowerPoint slide decks with AIβ298Updated last week
- No-code ETL and data pipelines with AI and NLPβ319Updated 9 months ago
- Prompt optimization scratchβ875Updated 7 months ago
- Local semantic search. Stupidly simple.β435Updated last year
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']β479Updated 11 months ago
- β235Updated last year
- The easiest, and fastest way to run AI-generated Python code safelyβ344Updated last year
- A tool for generating function arguments and choosing what function to call with local LLMsβ433Updated last year
- The simplest open-source implementation of perplexity.aiβ321Updated 10 months ago