Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This project only focused on variants of vanilla Transformer (Conformer) and Feature Extraction (CNN-based approach).
☆10Dec 27, 2021Updated 4 years ago
Alternatives and similar repositories for conformer_ocr
Users that are interested in conformer_ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Website nhận diện và trích xuất thông tin từ Chứng Minh Nhân Dân☆11Oct 6, 2022Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Nov 29, 2021Updated 4 years ago
- Zalo Text-To-Speech for python☆11May 10, 2021Updated 4 years ago
- python scripts for crawling original image from Google Images☆24May 5, 2022Updated 3 years ago
- Noise2Noise: Learning Image Restoration without Clean Data☆14Feb 22, 2020Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆46Jun 11, 2024Updated last year
- ☆24Nov 21, 2023Updated 2 years ago
- In this repo, I use encoder, decoder with attention mechanism to auto-correct output of vietnamese ocr model☆30Oct 12, 2021Updated 4 years ago
- Spell check for Arabic text using python☆14Mar 22, 2019Updated 7 years ago
- Simplified implementation for Domain Seperation Networks☆13Feb 11, 2023Updated 3 years ago
- Scripts to finetune the official implementation of OpenAI's Whisper model☆24Jul 6, 2025Updated 8 months ago
- Co:here-powered Slack App Starter Project☆13Apr 1, 2022Updated 3 years ago
- About MIMBCD-UI Project☆14Dec 4, 2025Updated 3 months ago
- Everyday Arabic-English Scene Text dataset☆16Oct 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Pytorch Implementation of Chargrid Paper (https://arxiv.org/abs/1809.08799)☆27Mar 11, 2022Updated 4 years ago
- A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.☆21Apr 7, 2022Updated 3 years ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- ☆44Apr 29, 2022Updated 3 years ago
- Unofficial implementation of 3D Instance Segmentation via Multi-Task Metric Learning (MTML)☆15Jul 5, 2020Updated 5 years ago
- This is a modified version of Ankush's code for generating synthetic text images which support right-to-left languages such as Persian a…☆20Jul 3, 2019Updated 6 years ago
- Lecture Video Summarization by Extracting Handwritten Content from Whiteboards☆21Aug 22, 2019Updated 6 years ago
- A python package made to generate sequences (greedy and beam-search) from Pytorch (not necessarily HF transformers) models.☆18Dec 12, 2025Updated 3 months ago
- 一个快速实验你的idea的脚手架项目,同时也欢迎大家补充新鲜技术进来。☆19Jun 3, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Playground of Metric Learning with MNIST @pytorch. We provide ArcFace, CosFace, SphereFace, CircleLoss and visualization.☆24Jun 11, 2021Updated 4 years ago
- [AVI 2020] UTA4: Medical Imaging DICOM files dataset.☆17Jan 4, 2021Updated 5 years ago
- PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)☆313Apr 9, 2024Updated last year
- PyTorch Implementation of Temporal Shift Module for Jester☆13Nov 22, 2022Updated 3 years ago
- Web 应用开发的模板,后端 SpringBoot,前端 vue(Web app development template. Backend is SpringBoot. Frontend is vue)☆18Dec 10, 2022Updated 3 years ago
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆17Aug 29, 2024Updated last year
- The OWCA dataset is a polish translated dataset of instructions for fine-tuning the Alpaca model made by Stanford .☆21May 18, 2023Updated 2 years ago
- A tool using Keras models which is implementation of YOLOv4 (Tensorflow backend) for detection and VietOCR for recognizion.☆20Oct 3, 2023Updated 2 years ago
- ☆22Jun 15, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆18Mar 30, 2023Updated 2 years ago
- RASA based voice bot after 1 months jump in to AI ;)☆29Sep 3, 2019Updated 6 years ago
- 网易云音乐周杰伦Jay Chou歌曲评论情感分析,有趣的歌曲分类小实验☆14Aug 12, 2019Updated 6 years ago
- Testing BIGAN (Adversarial Feature Learning) for State Representation Learning☆18Mar 29, 2018Updated 7 years ago
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆353Nov 29, 2023Updated 2 years ago
- "It does not do to dwell on dreams and forget to live."― J.K. Rowling☆42Jun 5, 2021Updated 4 years ago
- Application for Math formula detection in image/pdf and then recognition☆12Jan 14, 2025Updated last year