XJF2332/GOT-OCR-2-GUI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XJF2332/GOT-OCR-2-GUI)

XJF2332 / GOT-OCR-2-GUI

GOT-OCR的GUI版本，提供OCR、导出PDF、批处理等功能，但不提供训练功能

☆182

Alternatives and similar repositories for GOT-OCR-2-GUI

Users that are interested in GOT-OCR-2-GUI are comparing it to the libraries listed below

Sorting:

1694439208 / GOT-OCR-Inference
View on GitHub
研究GOT-OCR-项目落地加速，不限语言
☆62Oct 24, 2024Updated last year
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,089Feb 10, 2025Updated last year
Ucas-HaoranWei / Vary-tiny-600k
View on GitHub
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆86Sep 21, 2024Updated last year
MosRat / got.cpp
View on GitHub
Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0
☆15Mar 6, 2025Updated last year
ElvisClaros / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆23Sep 26, 2024Updated last year
newlxj / DeepSeek-OCR-Web-UI
View on GitHub
DeepSeek OCR WebUI for fast standalone operation
☆167Oct 31, 2025Updated 4 months ago
v3ucn / IMAGdressing_WebUi_For_Windows
View on GitHub
IMAGdressing在Windows环境下运行的webui界面
☆22Jul 25, 2024Updated last year
lihaochen910 / Use_Switch_Gamepad_On_Win10
View on GitHub
在win10系统上使用Nintendo Switch Pro Controller / Joycon手柄
☆11Jul 23, 2019Updated 6 years ago
frankchieng / ComfyUI_llm_easyanimiate
View on GitHub
easyanimate generete videos with ExLlamaV2 quantization LLM prompt
☆13Jun 26, 2024Updated last year
sugarforever / openai-swarm-tutorials
View on GitHub
☆10Oct 23, 2024Updated last year
LingyvKong / OneChart
View on GitHub
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
☆260Apr 14, 2025Updated 10 months ago
cronrpc / Audio-Speaker-Needle-In-Haystack
View on GitHub
Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。
☆13Jun 17, 2024Updated last year
v3ucn / RWKV_Role_Playing_with_GPT-SoVITS
View on GitHub
基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目
☆30Apr 8, 2024Updated last year
cnhnkj / hn_ocr
View on GitHub
基于cnstd+cnocr作为基础，封装的一个ocr的web服务
☆11Nov 21, 2021Updated 4 years ago
1005568045 / kettle-scheduler-boot
View on GitHub
☆15Jun 21, 2022Updated 3 years ago
verarong / invoice_ocr
View on GitHub
maskrcnn分割、angle旋转方向、AdvanceEast定位文本框、dense识别，含整个后处理工程，serving部署
☆13May 12, 2021Updated 4 years ago
RapidAI / RapidLayout
View on GitHub
Analysis of Chinese and English layouts 中英文版面分析
☆268Feb 25, 2026Updated last week
jack139 / ocr-rare-chars
View on GitHub
生僻字OCR识别优化训练
☆16Feb 16, 2023Updated 3 years ago
QIN2DIM / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆17Oct 12, 2024Updated last year
sumukshashidhar / yourbench
View on GitHub
Benchmark Large Language Models Reliably On Your Data
☆18Dec 27, 2025Updated 2 months ago
EDDChang / TSRFormer-Transformer-Based-Two-stage-Refinement-for-Single-Image-Shadow-Removal
View on GitHub
☆15Apr 13, 2023Updated 2 years ago
astordu / agent_from_scratch
View on GitHub
从零构建了Agent中最重要的功能-function call
☆18Oct 16, 2024Updated last year
BADBADBADBOY / CardDetectRotate
View on GitHub
卡证和文档检测和矫正
☆80Sep 18, 2024Updated last year
liunian-Jay / MU-GOT
View on GitHub
PDF Parsing Tool: GOT's vLLM acceleration implementation, MinerU for layout recognition, and GOT for table formula parsing.
☆65Nov 7, 2024Updated last year
v3ucn / YiJianBaPu
View on GitHub
一键扒谱
☆18Nov 15, 2023Updated 2 years ago
rkuo2000 / GenAI
View on GitHub
☆11Feb 25, 2026Updated last week
v3ucn / visual_chatgpt_mps_cut
View on GitHub
微软开源的可视化ChatGPT改造为苹果M系列芯片基于MPS版本，减少内存占用
☆15Mar 12, 2023Updated 2 years ago
sheryc / arxiv-markdown-parser-plugin
View on GitHub
Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.
☆90Mar 20, 2025Updated 11 months ago
Samge0 / test2neo4j
View on GitHub
Text2Neo4j 是一个遍历文档、从文本中提取关系并将其保存到 Neo4j 数据库中以形成知识图谱的工具。本项目结合了 Dify 和 LLaMA3.1（8B 模型）来高效处理和提取复杂关系。
☆24Aug 31, 2024Updated last year
Manni1000 / OmniGen
View on GitHub
☆18Oct 26, 2024Updated last year
brightwang / dify_code_node_get_image
View on GitHub
如何让 dify工作流的 code 节点拿到图片的信息
☆31Feb 24, 2025Updated last year
PirateforFreedom / luann
View on GitHub
Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)
☆24Feb 9, 2026Updated 3 weeks ago
Theigrams / g1
View on GitHub
g1: Using GPT-4o to create o1-like reasoning chains
☆20Sep 17, 2024Updated last year
taishan666 / MaxKB4j
View on GitHub
MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly …
☆37Feb 28, 2026Updated last week
RUC-NLPIR / FlashRAG-Paddle
View on GitHub
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
☆22Dec 11, 2024Updated last year
intsig-textin / textin-ocr-frontend
View on GitHub
☆30May 9, 2025Updated 9 months ago
v3ucn / ASR_TOOLS_SenseVoice_WebUI
View on GitHub
Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型
☆184Jul 10, 2024Updated last year
andyzsf / EC-Spider
View on GitHub
Unofficial Python APIs for Crawling, concluding Tmall, JD, Taobao etc.非官方爬虫抓取信息接口，天猫京东优酷等，供python爬虫使用
☆20Jun 10, 2015Updated 10 years ago
v3ucn / DCT-Net_Webui
View on GitHub
基于DCT-Net的图片/视频转绘gradio界面webui
☆27Jun 24, 2024Updated last year