NoEdgeAI / doc2x-doc
doc2x docs
☆51Updated 4 months ago
Alternatives and similar repositories for doc2x-doc:
Users that are interested in doc2x-doc are comparing it to the libraries listed below
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆257Updated 2 months ago
- 通过paddle ocr实现pdf转markdown☆67Updated 6 months ago
- 可能是免费中最好的搜索引擎API,支持Google,Bing,DuckDuckGo,Yahoo☆111Updated last year
- ☆29Updated last month
- Using GPT to parse PDF☆95Updated 7 months ago
- AM (Advanced Mathematics) Chat is a large language model that integrates advanced mathematical knowledge, exercises in higher mathematics…☆182Updated 8 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆105Updated 8 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 轻松构建智能、具备反思能力、可协作的多模态AI Agent。☆152Updated this week
- ☆69Updated last year
- 😜 表情包视觉数据集,使用glm-4v、step-1v的图像解析能力标注。☆119Updated 11 months ago
- 基于MinerU的桌面应用程序,MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆77Updated 6 months ago
- ☆63Updated 7 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆191Updated 5 months ago
- ☆39Updated 10 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆91Updated 5 months ago
- MinerU API server☆52Updated 4 months ago
- 顾名思义:手搓的RAG☆121Updated last year
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆33Updated 8 months ago
- Convert different model APIs into the OpenAI API format out of the box.☆151Updated last year
- 🌞 CareLlama (关怀羊驼)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.☆39Updated last year
- ChatGPT WebUI using gradio. 给 LLM 对话和检索知识问答RAG提供一个简单好用的Web UI界面☆125Updated 8 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆201Updated 3 weeks ago
- MultiBot Chat 是一个基于 Streamlit 的多机器人聊天应用,支持多种大语言模型(LLM)API,包括 OpenAI、AzureOpenAI、ChatGLM、CoZe、Qwen、Ollama、XingHuo、DeepSeek、Moonshot、Yi 和 G…☆33Updated last month
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆167Updated last week
- [CICAI 2023] The official codes for "Ivygpt: Interactive chinese pathway language model in medical domain"☆59Updated 6 months ago
- a useful PDF Translate tool base on LLM/ 一个基于大语言模型的PDF翻译程序☆66Updated 8 months ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆279Updated 7 months ago
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆118Updated last year
- 文档全文翻译器:英文PDF/MD论文 → (PDF Doc2X识别) → 翻译(GPT deepseek ollama google deepL deepLX)→ 中文文档(Markdown/Word)☆87Updated 2 months ago
- lang2openai☆68Updated 5 months ago