Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。
☆125Jan 26, 2026Updated last month
Alternatives and similar repositories for imgocr
Users that are interested in imgocr are comparing it to the libraries listed below
Sorting:
- 一个微博毒舌AI,疯狂 diss 微博博主☆15Jan 2, 2025Updated last year
- auto push daily news with ai☆13Updated this week
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆16Jun 30, 2022Updated 3 years ago
- 🚀 本代码仓致力于分享AI领域的核心知识,涵盖了AI-Agent、RAG(Retrieval-Augmented Generation)、GraphRAG、大模型、大模型微调以及多模态等多个热点话题。这里将提供丰富的代码实例、理论解析和实战技巧,帮助你更好地理解和应用人工…☆12Jul 28, 2024Updated last year
- AI Manga Editor capable of text recognition, translation, inpainting and editing.☆21Mar 25, 2025Updated 11 months ago
- 【今日头条】文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago
- ☆10Jul 13, 2024Updated last year
- Automatic development for retrieval augmented generation system☆10Feb 2, 2025Updated last year
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- It is deep recommendation model with attribute-level co-attention, which has been accepted as a short paper in SIGIR2020.☆10Aug 13, 2020Updated 5 years ago
- AI大模型的基本开发框架,适合普通后端程序员,功能类似coze包括:fastapi后端接口,搜索,文档解析和向量化,RPA和爬虫,自定义agent,对接第三方数据接口,mongodb数据库,控制json返回,多模态理解和生成等等☆13Jul 18, 2024Updated last year
- 百度UIE抽取模型torch版训练预测框架☆12Nov 20, 2024Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆49Mar 22, 2025Updated 11 months ago
- 本demo使用ultralytics-YOLO8对水印位置进行模型训练&检测,然后使用IOPaint移除检测到的水印。☆37Oct 23, 2024Updated last year
- A serverless RSS feed for HuggingFace's Daily Papers☆20Sep 18, 2025Updated 5 months ago
- Describe images with local LLMs (Ollama)☆18Jan 7, 2026Updated 2 months ago
- Demo of building a flower image search using GNES Flow API☆14Mar 24, 2023Updated 2 years ago
- ☆12Jan 25, 2023Updated 3 years ago
- API server for VibeVoice☆27Sep 28, 2025Updated 5 months ago
- Code for "RUBIK:A Structured Benchmark for Image Matching across Geometric Challenges", CVPR 2025☆25Jun 15, 2025Updated 8 months ago
- A WebUI script that deduplicates images or clusters them by tags or WD14. 一个用于图像查重和基于tags或者WD14提取的特征进行聚类的WebUI脚本☆12Aug 8, 2023Updated 2 years ago
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆16Nov 9, 2024Updated last year
- This repo contains VPR models that have been fine-tuned for indoor usage.☆16May 15, 2024Updated last year
- 文档方向分类☆222Feb 3, 2026Updated last month
- This repository is about an APP to help lawyers to process law documents and suit cases using AI Agents trained with OpenAI and others LL…☆18Aug 14, 2023Updated 2 years ago
- Unified Alfred Web Search☆19Oct 20, 2024Updated last year
- A Unified Perspective-to-Equirectangular Visual Place Recognition Framework☆19Dec 19, 2025Updated 2 months ago
- In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Pro…☆11Aug 23, 2021Updated 4 years ago
- 使用labelImg对水印位置进行标注,ultralytics-YOLO8对水印位置进行模型训练&检测。☆20Feb 11, 2026Updated 3 weeks ago
- ☆11Updated this week
- Datasets for long-term visual localization with sequential images in large-scale spaces☆19Apr 20, 2023Updated 2 years ago
- ☆19Sep 19, 2024Updated last year
- SuperPoint features in endoscopy☆18Mar 20, 2023Updated 2 years ago
- Rotation equivariance meets local feature matching☆18Oct 20, 2022Updated 3 years ago
- Raw dataset combining precise GNSS and IMU information in a live railway application☆19Sep 13, 2023Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated 11 months ago
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Jan 2, 2025Updated last year
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆24Feb 9, 2026Updated last month