opendatalab / MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
☆17,775Updated this week
Related projects ⓘ
Alternatives and complementary repositories for MinerU
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆5,743Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆14,240Updated this week
- Convert PDF to markdown quickly with high accuracy☆17,845Updated this week
- Get your documents ready for gen AI☆9,923Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,053Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,185Updated 2 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,985Updated this week
- Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accoun…☆4,704Updated 2 weeks ago
- Using GPT to parse PDF☆3,036Updated 3 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆13,436Updated 3 weeks ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆5,648Updated 2 weeks ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆12,690Updated this week
- Question and Answer based on Anything.☆11,890Updated this week
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure …☆44,745Updated this week
- PDF to Markdown with vision models☆6,324Updated this week
- Open source real-time translation app for Android that runs locally☆6,828Updated 2 weeks ago
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆6,791Updated this week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆18,267Updated this week
- The Memory layer for your AI apps☆22,875Updated this week
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,494Updated 7 months ago
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆23,277Updated this week
- #1 Locally hosted web application that allows you to perform various operations on PDF files☆46,387Updated this week
- Brand new TTS solution☆14,572Updated this week
- Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆15,471Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆9,783Updated this week
- OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打…☆19,242Updated last week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆5,422Updated this week
- Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dow…☆4,624Updated this week
- Python scraper based on AI☆15,802Updated this week