NoEdgeAI / doc2x-docLinks
doc2x docs
☆73Updated last year
Alternatives and similar repositories for doc2x-doc
Users that are interested in doc2x-doc are comparing it to the libraries listed below
Sorting:
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆284Updated 6 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆207Updated last year
- Using GPT to parse PDF☆102Updated last year
- 通过paddle ocr实现pdf转markdown☆78Updated last year
- Convert different model APIs into the OpenAI API format out of the box.☆160Updated last year
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆179Updated last month
- 可能是免费中最好的搜索引擎API,支持Google,Bing,DuckDuckGo,Yahoo☆144Updated 2 years ago
- 利 用LLM+敏感词库,来自动判别是否涉及敏感词。☆136Updated 2 years ago
- ☆111Updated last year
- ChatPPT is powered by chatgpt/ollama, it could help you to generate PPT/slide. It supports output in English and Chinese☆306Updated 7 months ago
- Based on RapidOCR, extract the PDF content☆184Updated 7 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆124Updated 6 months ago
- MinerU API server☆84Updated last year
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year
- Fast pdf translate是一款pdf翻译软件,基于MinerU实现pdf转markdown的功能,接着对markdown进行分割, 送给大模型翻译,最后组装翻译结果并由pypandoc生成结果pdf。☆39Updated 9 months ago
- 一个可以验证和计算文本消耗 Token 的小工具,支持在浏览器中使用,汉化自 OpenAI Tokenizer。☆61Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆259Updated 4 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆74Updated 3 weeks ago
- 本数据集属于 根因分析与练习系统(Root Cause Analysis and Exercises for Mathematics, RCAE) 的基础子项目之一。旨在可以高效的发现学生数学、生物学业错误的根本原因。☆48Updated last year
- AI Q&A Search Engine ➡️ 基于LangChain和SearXNG打造的开源AI搜索引擎☆212Updated 7 months ago
- Create your own GPT intelligent assistants using Azure OpenAI, Ollama, and local models, build and manage local knowledge bases, and expa…☆102Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆242Updated last week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆190Updated last year
- optimize your prompt like promptperfect|万能提示词|大语言模型提示词优化☆46Updated 2 years ago
- ☆509Updated 9 months ago
- ☆547Updated last year
- ☆70Updated last year
- Python 接入文多多AiPPT,通过主题/文件/网址等方式生成PPT,支持原生图表、动画、3D特效等复杂PPT的解析和渲染,支持用户自定义模板,支持智能添加动画。AI generates PowerPoint Presentation, Supports parsing…☆29Updated last year
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆144Updated last year
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 构建智能的多模态AI Agent。☆228Updated this week