XJF2332 / GOT-OCR-2-GUIView external linksLinks
GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能
☆182Nov 11, 2025Updated 3 months ago
Alternatives and similar repositories for GOT-OCR-2-GUI
Users that are interested in GOT-OCR-2-GUI are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆62Oct 24, 2024Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- Using Llam.cpp and onnxruntime to accelerate inference of GOT-OCR2.0☆15Mar 6, 2025Updated 11 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Sep 26, 2024Updated last year
- 在win10系统上使用Nintendo Switch Pro Controller / Joycon手柄☆11Jul 23, 2019Updated 6 years ago
- Accelerating GOT-OCRv2 with VLLM☆11Nov 15, 2024Updated last year
- ☆10Oct 23, 2024Updated last year
- This project provides an AI-driven test case generator using FastAPI. The application accepts a GitHub repository name and generates test…☆19Jun 7, 2024Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆259Apr 14, 2025Updated 9 months ago
- 基于通义千问大模型的智能 OCR 识别工具☆29Mar 31, 2025Updated 10 months ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆30Apr 8, 2024Updated last year
- 基于cnstd+cnocr作为基础,封装的一个ocr的web服务☆11Nov 21, 2021Updated 4 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆261Updated this week
- ☆15Apr 13, 2023Updated 2 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆17Oct 12, 2024Updated last year
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 2 years ago
- 卡证和文档检测和矫正☆79Sep 18, 2024Updated last year
- ☆11Feb 6, 2026Updated last week
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆90Mar 20, 2025Updated 10 months ago
- ☆18Oct 26, 2024Updated last year
- 如何让 dify工作流的 code 节点拿到图片的信息☆31Feb 24, 2025Updated 11 months ago
- Compute benchmark of table structure recognition.☆28Dec 2, 2025Updated 2 months ago
- MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly …☆35Updated this week
- Google AI Translator extension, using Google Chrome's built-in AI.☆29Sep 15, 2025Updated 4 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Sep 17, 2024Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- 基于DCT-Net的图片/视频转绘gradio界面webui☆27Jun 24, 2024Updated last year
- Gradient Boosting Models on Real-Time Sensor Data for AI-Enhanced Vehicle Predictive Maintenance. By using a web-based interface to forec…☆19Nov 17, 2024Updated last year
- Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step☆159Jul 28, 2025Updated 6 months ago
- AutoDev Workbench is an AI-native developer platform designed to accelerate, automate, and contextualize modern software development work…☆75Oct 22, 2025Updated 3 months ago
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆27Sep 26, 2024Updated last year
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- zlai☆23Sep 29, 2024Updated last year
- 实现使用开源的LangFlow框架,零代码实现大模型相关应用如流量包推荐智能客服、RAG应用等,并使用两种方式将创建的工作流集成到自己的项目中☆31Sep 9, 2024Updated last year
- Collection of projects / apps integrated with dify service API.☆19Oct 20, 2024Updated last year
- 基于Faster-whisper和modelscope一键生成双语字幕,双语字幕生成器,基于离线大模型,Generate bilingual subtitles with one click based on Faster-whisper and modelscope. O…☆414Dec 1, 2024Updated last year
- PANTERA is an open-source software for the simulation of nonequilibrium gas and plasma flows based on the Direct Simulation Monte Carlo a…☆18Dec 1, 2025Updated 2 months ago
- Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL☆62Jun 19, 2024Updated last year