A Text-To-Speech Model Developed Using 🐸STT
☆12Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for persian-stt
Users that are interested in persian-stt are comparing it to the libraries listed below
Sorting:
- Matplotlib Image labeller for classifying images☆11Jan 5, 2026Updated 2 months ago
- ☆10Apr 24, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Dedicated to Code in Place Spring 2021 with Stanford University or to those who are interested to learn Python for the first time☆12May 16, 2025Updated 9 months ago
- Implemented Unet++ models for medical image segmentation to detect and classify colorectal polyps.☆10Sep 5, 2021Updated 4 years ago
- Email OSINT and password breach hunting. Use h8mail to find passwords through different breach and reconnaissance services, or the infamo…☆10Jun 12, 2019Updated 6 years ago
- ☆10Aug 30, 2023Updated 2 years ago
- Slick video review note taking app 🎬☆12Dec 25, 2025Updated 2 months ago
- A Persian Word2Vec Model trained by Wikipedia articles☆10Jan 5, 2018Updated 8 years ago
- VIP Machine Learning Exercises and Practices☆10Dec 24, 2019Updated 6 years ago
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- [IJCAI'23] Semantic-aware Generation of Multi-view Portrait Drawings (SAGE)☆10Feb 25, 2024Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- The Oxford RobotCar Facade dataset.☆11Jun 4, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Built and trained a deep neural network to classify traffic signs, using TensorFlow.☆10Jun 23, 2017Updated 8 years ago
- Undergrad Major Project. Video to Video summary of full length football match.☆11Jun 3, 2019Updated 6 years ago
- Build an OpenAI Art Generator & Gallery - JavaScript Workshop☆17Jun 8, 2023Updated 2 years ago
- Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine☆11Mar 1, 2018Updated 8 years ago
- A simple baseline for 3d human pose estimation in PyTorch☆12Jul 25, 2024Updated last year
- A research based project which uses steganography and ML/deep learning algorithm to reconstruct the lost audio signals from a corrupted f…☆12Dec 5, 2022Updated 3 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Python based voice assistant using the ChatGPT API☆11Sep 20, 2023Updated 2 years ago
- Minimalist Speech-to-Text toolkit for educational purposes☆13Feb 1, 2024Updated 2 years ago
- Cod ar gyfer 'Macsen' - prototeip o gynorthwyydd digidol Cymraeg i'r Raspberry Pi // Code for 'Macsen' - a prototype Welsh language digit…☆11Mar 29, 2018Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- Code to Implement the Smooth Euler Characteristic Transform (SECT)☆12Oct 22, 2019Updated 6 years ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated last year
- The Hidden Markov Model Toolkit (HTK)☆14Apr 21, 2017Updated 8 years ago
- Realtime pose detection in Unity Engine with NatML.☆15Aug 12, 2023Updated 2 years ago
- This repository contains Python code for an age and gender detection project using the video stream from the camera. The model is trained…☆10May 28, 2023Updated 2 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- [AAAIW 2022] DADFNet: Dual Attention and Dual Frequency-Guided Dehazing Network for Video-Empowered Intelligent Transportation☆13Jul 21, 2024Updated last year
- A helpful application that uses your camera to turn sign language into written words, making communication easier for Deaf and Hard of He…☆17Oct 4, 2023Updated 2 years ago
- ☆11May 8, 2023Updated 2 years ago
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- [VISAPP 2022] MdVRNet: Deep Video Restoration under Multiple Distortions☆12Aug 7, 2024Updated last year