DunnBC22 / Vision_Audio_and_Multimodal_ProjectsLinks
This repository includes all computer vision, audio, document AI, and multimodal projects.
☆44Updated last year
Alternatives and similar repositories for Vision_Audio_and_Multimodal_Projects
Users that are interested in Vision_Audio_and_Multimodal_Projects are comparing it to the libraries listed below
Sorting:
- An end-to-end signature verification system to extract, clean and verify signatures in documents. Signatures are detected using YOLOv5. N…☆180Updated last year
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆29Updated 4 years ago
- ☆41Updated last year
- Working codes for project☆23Updated last year
- Machine Learning Training Utilities (for TensorFlow and PyTorch)☆244Updated 3 months ago
- ☆12Updated last year
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆53Updated 2 years ago
- IAM dataset☆62Updated 2 years ago
- Key information extraction from invoice document with Graph Convolution Network☆56Updated 2 years ago
- ☆75Updated 2 years ago
- My personal implementation of SVTR model for handwritten OCR☆13Updated last year
- ☆20Updated 3 years ago
- YOLO for custom object detection and passing the detected objects to Tesseract☆62Updated last year
- ☆250Updated last year
- Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch☆174Updated 9 months ago
- Handwritten text recognition using transformers.☆158Updated 11 months ago
- Transformer OCR for Indian Languages☆11Updated last year
- Computer Vision Projects☆169Updated last year
- A collection for AI Engineer☆40Updated last week
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challenge☆31Updated 2 years ago
- Proceed with text detection only in the selected area of the image☆223Updated last year
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆55Updated 9 months ago
- This is all my notebooks, lab solutions, and assignments for the DeepLearning.AI Natural Language Processing Specialization on Coursera.☆47Updated 2 years ago
- TensorFlow: Advanced Techniques Course material on cousera this repository is for learning purpose.☆61Updated 3 years ago
- A project where the license plate number is extracted from image of a vehicle using Object detection and Character recognition techniques…☆99Updated 4 years ago
- TableNet Implementation on Pytorch☆148Updated 2 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- Object Counting with the newest yolov7☆119Updated 2 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated 11 months ago
- Object Detection Web App Using YOLOv7 and Flask☆56Updated 2 years ago