DunnBC22 / Vision_Audio_and_Multimodal_ProjectsLinks
This repository includes all computer vision, audio, document AI, and multimodal projects.
☆49Updated last year
Alternatives and similar repositories for Vision_Audio_and_Multimodal_Projects
Users that are interested in Vision_Audio_and_Multimodal_Projects are comparing it to the libraries listed below
Sorting:
- An end-to-end signature verification system to extract, clean and verify signatures in documents. Signatures are detected using YOLOv5. N…☆190Updated last year
- ☆40Updated last year
- Working codes for project☆23Updated 2 years ago
- Machine Learning Training Utilities (for TensorFlow and PyTorch)☆248Updated 7 months ago
- Computer Vision Projects☆175Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated last year
- ☆28Updated 3 years ago
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆53Updated 3 years ago
- Streamlit YOLOv5 deployment template☆28Updated 3 months ago
- This is all my notebooks, lab solutions, and assignments for the DeepLearning.AI Natural Language Processing Specialization on Coursera.☆48Updated 3 years ago
- ☆30Updated 2 years ago
- ☆44Updated 3 years ago
- A project where the license plate number is extracted from image of a vehicle using Object detection and Character recognition techniques…☆103Updated 4 years ago
- This project focuses on fine-tuning a BERT model for question answering using a limited dataset for illustration purposes.☆30Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆224Updated 9 months ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆29Updated 4 years ago
- This repo consists of the code as discussed in the Medium blog.☆16Updated 2 years ago
- ☆67Updated 2 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆59Updated last year
- A Great Collection of Deep Learning Tutorials and Repositories☆321Updated this week
- An SDK for Transformers + YOLO and other SSD family models☆64Updated 9 months ago
- MLOPs human pose estimation end-to-end.☆37Updated last year
- ☆76Updated 2 years ago
- ☆27Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- This repository demonstrates the data preparation and fine-tuning the IDEFICS Vision Language Model.☆25Updated last year
- In this project we utilize OpenCV t in order to identify the license number plates and the python pytesseract for the characters and digi…☆86Updated 4 months ago
- A collection for AI Engineer☆41Updated 3 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- ☆32Updated 3 years ago