LeadingIndiaAI / -IMAGE-TO-SPEECH-CONVERTOR-View external linksLinks
The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the characters are combined to form words and save it as a text file. This text file is converted to speech. We use two tools for the completion of image to text to speech conversion. They are OCR (Optical Charact…
☆13Sep 12, 2018Updated 7 years ago
Alternatives and similar repositories for -IMAGE-TO-SPEECH-CONVERTOR-
Users that are interested in -IMAGE-TO-SPEECH-CONVERTOR- are comparing it to the libraries listed below
Sorting:
- This project implements a simple Instagram-like login form that connects to a MongoDB database. It uses Node.js, Express, and Mongoose fo…☆19May 3, 2025Updated 9 months ago
- This was our Final Project for Distributed Computing. In this we had to create a distributed system that will use the Brute Force Algorit…☆11Aug 14, 2021Updated 4 years ago
- Electrophysiology practicals for undergraduate students☆13Mar 8, 2021Updated 4 years ago
- ☆10Apr 2, 2024Updated last year
- Modelo de Inteligencia Artificial utilizando Computer Vision para la detección y segmentacion de plantas medicinales en la ciudad de Sucr…☆10Apr 10, 2024Updated last year
- A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the tex…☆12Oct 2, 2020Updated 5 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Feb 3, 2026Updated last week
- Google Collab Notebooks☆13Dec 14, 2024Updated last year
- Python-based geospatial analysis codes related to: GOES-16 satellite data, national land cover database (NLCD), elevation maps, building …☆13Sep 5, 2020Updated 5 years ago
- Educational Android VR game for people with Neurodevelopmental Disorders. Developed with Unity, the game adapts its difficulty and aims t…☆14Mar 18, 2018Updated 7 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- This repository contains a diverse collection of case studies and use cases commonly asked in data science interviews across different co…☆13Apr 14, 2024Updated last year
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- EC499: Major Project☆10Jun 25, 2023Updated 2 years ago
- A project on Image Processing, leveraging PyQt5 for a user-friendly GUI and implementing essential operations like Low Pass Filter, Downs…☆11Nov 24, 2024Updated last year
- [arXiv 2024] PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆13Dec 2, 2025Updated 2 months ago
- SpinVision is a computer vision project that analyzes the motion of a cricket ball in video footage. It detects the ball's trajectory, pr…☆14Jun 12, 2025Updated 8 months ago
- COVID-19: Face Mask Detector with OpenCV, Keras/TensorFlow, and Deep Learning☆12Dec 23, 2020Updated 5 years ago
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated last year
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- ☆10Aug 13, 2020Updated 5 years ago
- ☆13Oct 7, 2019Updated 6 years ago
- ☆10May 5, 2020Updated 5 years ago
- ☆16Dec 3, 2022Updated 3 years ago
- TensorRT implementation of the waifu2x super-resolution model for faster image and video upscaling.☆18Nov 24, 2024Updated last year
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?☆14Jan 2, 2023Updated 3 years ago
- ☆18Dec 17, 2024Updated last year
- Fastest CUDA RGB to grayscale: 5-30x faster than OpenCV. For image processing/computer vision.☆16Mar 23, 2021Updated 4 years ago
- StammerClipper:: :A deep learning approach for automatic stutter detection☆12Mar 27, 2022Updated 3 years ago
- ☆13Aug 23, 2019Updated 6 years ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…☆13May 10, 2018Updated 7 years ago
- AUTOMATED TYPE CLASSIFICATION OF DIABETIC RETINOPATHY AND GLAUCOMA DETECTION USING DEEP LEARNING☆17Apr 15, 2019Updated 6 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆15Mar 4, 2022Updated 3 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 3 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- A flight simulator in VR!☆12Aug 13, 2018Updated 7 years ago
- MathBot is a transformer-based Math Word Problem (MWP) solver made as the Lab project for CSE 4622: Machine Learning Lab.☆13Jul 11, 2022Updated 3 years ago
- ☆14May 28, 2020Updated 5 years ago
- Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models☆17Mar 24, 2025Updated 10 months ago