The aim of the project was to convert an image to speech. An image is processed and segmented to identify the text in the image. Then the characters are combined to form words and save it as a text file. This text file is converted to speech. We use two tools for the completion of image to text to speech conversion. They are OCR (Optical Charact…
☆13Sep 12, 2018Updated 7 years ago
Alternatives and similar repositories for -IMAGE-TO-SPEECH-CONVERTOR-
Users that are interested in -IMAGE-TO-SPEECH-CONVERTOR- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project implements a simple Instagram-like login form that connects to a MongoDB database. It uses Node.js, Express, and Mongoose fo…☆19Feb 22, 2026Updated 2 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Apr 29, 2026Updated last week
- Generating Captions for images using CNN & LSTM on Flickr8K dataset.The generation of captions from images has various practical benefits…☆10Mar 25, 2021Updated 5 years ago
- Python-based geospatial analysis codes related to: GOES-16 satellite data, national land cover database (NLCD), elevation maps, building …☆13Sep 5, 2020Updated 5 years ago
- A project on Image Processing, leveraging PyQt5 for a user-friendly GUI and implementing essential operations like Low Pass Filter, Downs…☆12Nov 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Benchmarking Multidomain English-Indonesian Machine Translation☆16Dec 19, 2020Updated 5 years ago
- TEI2S is a project which is really helpful for the visually impaired, in a sense that it takes an image containing text embedding as the …☆15Nov 24, 2019Updated 6 years ago
- SMS where Students can apply for scholarships, Signatory can list their scholarship and validate the student applications, Admin can vali…☆10Apr 14, 2021Updated 5 years ago
- A Python GUI Quiz Application built using Tkinter and Open Trivia DB☆26Feb 22, 2023Updated 3 years ago
- ☆24Apr 6, 2024Updated 2 years ago
- A deep learning based application which is entitled to help the visually impaired people. The application automatically generates the tex…☆12Oct 2, 2020Updated 5 years ago
- This was our Final Project for Distributed Computing. In this we had to create a distributed system that will use the Brute Force Algorit…☆11Aug 14, 2021Updated 4 years ago
- This Project has been developed my me & my friend Veer for our Second Year Python Project. It uses Tkinter Module for GUI and MySQLDb for…☆22Aug 16, 2019Updated 6 years ago
- Detect and locate lung nodules from CT images with deep learning☆23Apr 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Tensorflow实现画像风格迁移☆23Jul 13, 2017Updated 8 years ago
- ☆10May 5, 2020Updated 6 years ago
- Educational Android VR game for people with Neurodevelopmental Disorders. Developed with Unity, the game adapts its difficulty and aims t…☆14Mar 18, 2018Updated 8 years ago
- A free ascii image filter api☆22Mar 1, 2022Updated 4 years ago
- Fastest CUDA RGB to grayscale: 5-30x faster than OpenCV. For image processing/computer vision.☆16Mar 23, 2021Updated 5 years ago
- Lung nodule detection using YOLO☆17Dec 23, 2017Updated 8 years ago
- ☆13Oct 7, 2019Updated 6 years ago
- Modelo de Inteligencia Artificial utilizando Computer Vision para la detección y segmentacion de plantas medicinales en la ciudad de Sucr…☆11Apr 10, 2024Updated 2 years ago
- TensorRT implementation of the waifu2x super-resolution model for faster image and video upscaling.☆18Nov 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- COVID-19: Face Mask Detector with OpenCV, Keras/TensorFlow, and Deep Learning☆12Dec 23, 2020Updated 5 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- ☆11Aug 23, 2019Updated 6 years ago
- An Android Operating System based Emergency SOS app.☆32Dec 21, 2017Updated 8 years ago
- ☆10Apr 2, 2024Updated 2 years ago
- Electrophysiology practicals for undergraduate students☆13Mar 8, 2021Updated 5 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆16Apr 1, 2026Updated last month
- Google Collab Notebooks☆14Dec 14, 2024Updated last year
- AUTOMATED TYPE CLASSIFICATION OF DIABETIC RETINOPATHY AND GLAUCOMA DETECTION USING DEEP LEARNING☆17Apr 15, 2019Updated 7 years ago
- PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆15Dec 2, 2025Updated 5 months ago
- A flight simulator in VR!☆13Aug 13, 2018Updated 7 years ago
- EC499: Major Project☆11Jun 25, 2023Updated 2 years ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…☆12May 10, 2018Updated 7 years ago