NoEdgeAI / doc2x-doc
doc2x docs
☆43Updated 2 months ago
Alternatives and similar repositories for doc2x-doc:
Users that are interested in doc2x-doc are comparing it to the libraries listed below
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆219Updated last month
- 通过paddle ocr实现pdf转markdown☆60Updated 3 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆178Updated 2 months ago
- Using GPT to parse PDF☆84Updated 4 months ago
- Based on RapidOCR, extract the PDF content.☆140Updated 5 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆72Updated 2 months ago
- bge-large-zh api service☆20Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆119Updated 2 months ago
- 第三方Doc2X桌面应用,支持Linux(X11,Wayland)/Windows☆33Updated 5 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆160Updated last month
- 可能是免费中最好的搜索引擎API,支持Google,Bing,DuckDuckGo,Yahoo☆84Updated last year
- ☆107Updated 5 months ago
- ☆99Updated last month
- TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一…☆84Updated last year
- 😆 Generate PPT by LLM follow your template. 📢 Not only use llm to generate ppt, but also according to your favorite ppt template. Just…☆56Updated 7 months ago
- ☆69Updated last year
- 🚀 聆心智能 Emohaa情感陪伴大模型逆向API【特长:共情能力】,支持高速流式输出、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。☆117Updated last month
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆48Updated this week
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆109Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 4 months ago
- 如需体验TextIn文档解析,请访问 https://cc.co/16YSIy☆115Updated last month
- This Python package provides a convenient and powerful interface to interact with the Dify API, enabling developers to integrate a wide r…☆27Updated this week
- Legal-Eagle-InternLM 是一个基于商汤科技和上海人工智能实验室推出的书生浦语大模型InternLM的法律问答机器人。旨在为用户提供符合3H(即Helpful、Honest、Harmless)原则的专业、智能、全面的法律服务的法律领域大模型。☆51Updated 11 months ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆55Updated last week
- AI Q&A Search Engine ➡️ 基于LangChain和SearXNG打造的开源AI搜索引擎☆125Updated 4 months ago
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆108Updated 9 months ago
- ragflow中的ocr部分,非官方项目☆13Updated 5 months ago
- EZ-Work AI文档翻译,人人可用的开源AI文档翻译助手,可以快速低成本调用OpenAI等大语言模型api,帮助您实现txt/markdown/word/csv/excel/pdf/ppt的文档翻译。☆164Updated this week
- 中文论文、证券类、财报类PDF数据☆23Updated 7 months ago
- bisheng-unstructured library☆41Updated 2 months ago