Containerised version of tesseract v4 tools required for training a new font
☆13Feb 11, 2022Updated 4 years ago
Alternatives and similar repositories for tesseract-trainer
Users that are interested in tesseract-trainer are comparing it to the libraries listed below
Sorting:
- A lightweight data processing framework built on DuckDB and 3FS.☆21Mar 2, 2025Updated last year
- Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.☆21Jul 25, 2023Updated 2 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- Stream IPTV to Discord☆11Sep 10, 2023Updated 2 years ago
- Botticelli is an open-source .NET Core framework for building universal chatbots. It enables seamless integration with databases, queue b…☆13Feb 25, 2026Updated last week
- Automate the KYC Process using OCR (Implemented from scratch)☆12May 23, 2020Updated 5 years ago
- ASP.NET Core Application provides APIs to detect and recognition face.☆12Mar 14, 2021Updated 4 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 10 months ago
- The Google Assistant on a rotary phone☆11May 1, 2021Updated 4 years ago
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆20Jun 7, 2025Updated 8 months ago
- Train Tesseract LSTM with make on Windows☆10Dec 24, 2023Updated 2 years ago
- This repository has a tool and an API for Saudi CERT alerts. Its goal is to help improve the level of cybersecurity awareness in Saudi Ar…☆13Nov 16, 2023Updated 2 years ago
- Undetectable fanfiction.net Downloader with Parallel Downloading☆10Oct 23, 2021Updated 4 years ago
- Digital Humanities project analyzing the language of fanfiction, pulled from the top 3.5k works on Archive of Our Own. Data scraped using…☆18Jul 2, 2021Updated 4 years ago
- Manage BMW iDrive backups locally. Create backups that iDrive will restore. Tidy up your music collection.☆15Sep 26, 2025Updated 5 months ago
- A graphical representation of relations between programming languages, technologies and skills in demand, based on tens of thousands of j…☆13Nov 25, 2023Updated 2 years ago
- This GitHub repository contains a web-based Facial Recognition Attendance System built with Python language and Streamlit framework. The …☆13Jan 26, 2024Updated 2 years ago
- Newsdata.io Official Python Client☆14Jan 14, 2026Updated last month
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- FaceSystem项目在会议场景中支持人脸识别的会议签到系统,实现了基本的会议管理功能,参会人信息可以预先通过人脸信息进行录入,录入成功后,参会人即可进行人脸识别签到。☆11Mar 4, 2023Updated 3 years ago
- yamaha wx-10 api☆14Nov 25, 2018Updated 7 years ago
- Information System for Media monitoring and analysis system Project under ПЦФ BR05236839☆10Oct 14, 2023Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆17Dec 16, 2020Updated 5 years ago
- Data generation, model training and inference for Visual Font Recognition using PyTorch☆17Dec 5, 2023Updated 2 years ago
- Extracting-Data-from-PDFs-with-Local-LLM☆16Nov 1, 2024Updated last year
- mono mkbundle unpacker and file replace tool.☆13Jul 7, 2015Updated 10 years ago
- Automatic Arabic diacritics restoration tool.☆18Aug 12, 2021Updated 4 years ago
- ☆11Sep 5, 2025Updated 5 months ago
- Altostratus Sample for MSDN Magazine articles☆15Jun 28, 2016Updated 9 years ago
- Unified-Multimodal Transformer Pipeline for Political Content Creation: TikTok Reel Generator (Highlight detection + visually tracked ver…☆16May 15, 2023Updated 2 years ago
- Coinlen is a cryptocurrency exchange tracking system. ♡ This project was built in Dumaguete City, Negros Oriental , Philippines. ♡☆14Jan 30, 2021Updated 5 years ago
- A Java toolkit to generate multi fonts Arabic text images☆11Sep 2, 2021Updated 4 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated 11 months ago
- The Andy Timeline: An AI assistant's origin story, told week by week☆24Updated this week
- Converters for various file formats used for representing OCR☆12Apr 30, 2025Updated 10 months ago
- Credit Card PAN scanner.☆17Jun 12, 2024Updated last year
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆59Oct 6, 2025Updated 4 months ago
- CoWorks is a unified compositional serverless microservices framework over AWS, Flask and Airflow technologies.☆17Dec 18, 2025Updated 2 months ago
- Tesseract4 finetuned traineddata for Central Kurdish/Sorani☆11Apr 18, 2020Updated 5 years ago