GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能
☆182Nov 11, 2025Updated 3 months ago
Alternatives and similar repositories for GOT-OCR-2-GUI
Users that are interested in GOT-OCR-2-GUI are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆62Oct 24, 2024Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,089Feb 10, 2025Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0☆15Mar 6, 2025Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Sep 26, 2024Updated last year
- DeepSeek OCR WebUI for fast standalone operation☆167Oct 31, 2025Updated 4 months ago
- IMAGdressing在Windows环境下运行的webui界面☆22Jul 25, 2024Updated last year
- 在win10系统上使用Nintendo Switch Pro Controller / Joycon手柄☆11Jul 23, 2019Updated 6 years ago
- easyanimate generete videos with ExLlamaV2 quantization LLM prompt☆13Jun 26, 2024Updated last year
- ☆10Oct 23, 2024Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆260Apr 14, 2025Updated 10 months ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆30Apr 8, 2024Updated last year
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆11Nov 21, 2021Updated 4 years ago
- ☆15Jun 21, 2022Updated 3 years ago
- maskrcnn分割、angle旋转方向、AdvanceEast定位文本框、dense识别,含整个后处理工程,serving部署☆13May 12, 2021Updated 4 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆268Feb 25, 2026Updated last week
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 3 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆17Oct 12, 2024Updated last year
- Benchmark Large Language Models Reliably On Your Data☆18Dec 27, 2025Updated 2 months ago
- ☆15Apr 13, 2023Updated 2 years ago
- 从零构建了Agent中最重要的功能-function call☆18Oct 16, 2024Updated last year
- 卡证和文档检测和矫正☆80Sep 18, 2024Updated last year
- PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.☆65Nov 7, 2024Updated last year
- 一键扒谱☆18Nov 15, 2023Updated 2 years ago
- ☆11Feb 25, 2026Updated last week
- 微软开源的可视化ChatGPT改造为苹果M系列芯片基于MPS版本,减少内存占用☆15Mar 12, 2023Updated 2 years ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆90Mar 20, 2025Updated 11 months ago
- Text2Neo4j 是一个遍历文档、从文本中提取关系并将其保存到 Neo4j 数据库中以形成知识图谱的工具。本项目结合了 Dify 和 LLaMA3.1(8B 模型)来高效处理和提取复杂关系。☆24Aug 31, 2024Updated last year
- ☆18Oct 26, 2024Updated last year
- 如何让 dify工作流的 code 节点拿到图片的信息☆31Feb 24, 2025Updated last year
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆24Feb 9, 2026Updated 3 weeks ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Sep 17, 2024Updated last year
- MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly …☆37Feb 28, 2026Updated last week
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆22Dec 11, 2024Updated last year
- ☆30May 9, 2025Updated 9 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- Unofficial Python APIs for Crawling, concluding Tmall, JD, Taobao etc.非官方爬虫抓取信息接口,天猫京东优酷等,供python爬虫使用☆20Jun 10, 2015Updated 10 years ago
- 基于DCT-Net的图片/视频转绘gradio界面webui☆27Jun 24, 2024Updated last year