opendatalab / MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
☆11,253Updated this week
Related projects: ⓘ
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆9,486Updated this week
- Question and Answer based on Anything.☆11,376Updated this week
- Convert PDF to markdown quickly with high accuracy☆16,438Updated last week
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆16,112Updated last month
- Brand new TTS solution☆11,190Updated this week
- A generative speech model for daily dialogue.☆30,703Updated 2 weeks ago
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆17,176Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆16,959Updated this week
- 🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。☆9,966Updated this week
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆10,156Updated last week
- OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打…☆17,930Updated 3 weeks ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆13,879Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆4,727Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆13,305Updated 2 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆32,567Updated this week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音☆9,955Updated this week
- Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, m…☆45,596Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆4,988Updated 3 weeks ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,451Updated 5 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆21,609Updated last month
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆4,578Updated last week
- Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accoun…☆3,723Updated 2 weeks ago
- Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.☆7,468Updated this week
- Python scraper based on AI☆14,399Updated this week
- 🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure …☆40,737Updated this week
- Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)☆30,812Updated this week
- 小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫☆16,530Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,363Updated this week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,202Updated 3 weeks ago