Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
☆514Mar 8, 2026Updated last month
Alternatives and similar repositories for wdoc
Users that are interested in wdoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆199Jun 2, 2025Updated 10 months ago
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,437Jan 12, 2026Updated 3 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆651Feb 24, 2025Updated last year
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,957Updated this week
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆2,380Sep 10, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,635Jan 20, 2025Updated last year
- 一个基于 AI 的 Hacker News 中文播客项目,每天自动抓取 Hacker News 热门文章,通过 AI 生成中文总结并转换为播客内容。☆2,495Feb 25, 2026Updated last month
- A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office…☆7,498Updated this week
- [NeurIPS '25] Knowledge Graph Generation from Any Text☆1,094Mar 24, 2026Updated 2 weeks ago
- ☆49Sep 11, 2025Updated 7 months ago
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆386Jul 4, 2025Updated 9 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,749Nov 19, 2025Updated 4 months ago
- Extract information from any website by chatting with AI - Fork of Vercel AI Chatbot w/ Firecrawl Integrated☆130Jan 24, 2025Updated last year
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆817Jan 30, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆86Jun 2, 2025Updated 10 months ago
- TrendPublish: 全自动 AI 内容生成与发布系统 | 微信公众号自动化 | 多源数据抓取 (Twitter/X、网站) | DeepseekAI、千问、讯飞模型 | 智能内容分析排序 | 定时发布 | 多模板支持 | Node.js | TypeScript |…☆2,871Apr 1, 2026Updated last week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆235Mar 27, 2026Updated 2 weeks ago
- ☆449Sep 18, 2024Updated last year
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆92Mar 20, 2025Updated last year
- ContextGem: Effortless LLM extraction from documents☆1,821Mar 16, 2026Updated 3 weeks ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆361Mar 20, 2026Updated 3 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆651Jan 11, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,759Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,526Aug 27, 2025Updated 7 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,592Mar 20, 2026Updated 3 weeks ago
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,282Sep 8, 2024Updated last year
- A Model Context Protocol server for converting almost anything to Markdown☆2,565Apr 3, 2026Updated last week
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot 🦞· …☆7,198Mar 25, 2026Updated 2 weeks ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆510Mar 9, 2026Updated last month
- A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.☆125Mar 5, 2025Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Dec 27, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆9,079Oct 16, 2025Updated 5 months ago
- ☆80Apr 15, 2025Updated 11 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,839Mar 30, 2026Updated last week
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆769Dec 16, 2025Updated 3 months ago
- Fetch arxiv data to LLM-friendly text☆130Feb 18, 2026Updated last month
- Full Stack application for retrieving Stock Data and News using LLM, LangChain and LangGraph☆740Dec 8, 2024Updated last year
- The most accurate document search and store for building AI apps☆3,568Apr 2, 2026Updated last week