suraneti / real-time-speech-translator
Real-time speech to text with specific language translation.
☆48Updated 4 years ago
Alternatives and similar repositories for real-time-speech-translator
Users that are interested in real-time-speech-translator are comparing it to the libraries listed below
Sorting:
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆14Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 10 months ago
- Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.☆34Updated 3 years ago
- canvas-based talking head model using viseme data☆31Updated last year
- Speech Emotion Detection using SVM, Decision Tree, Random Forest, MLP, CNN with different architectures☆35Updated last year
- Predict the speaker's gender from an audio file (Flask API included)☆20Updated 2 years ago
- AI-generated talking head video of fake people responding to your input question text.☆68Updated 4 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Updated 5 years ago
- Real-time human emotion detection and analysis through voice and speech pattern processing☆27Updated 6 years ago
- an improved version of Real-time-voice-cloning☆50Updated last year
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆98Updated 2 years ago
- The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the tr…☆15Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆171Updated 2 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆169Updated 4 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆208Updated 4 years ago
- Voice clone application in flask, forked version of CorentinJ Voice Cloning☆21Updated 4 years ago
- end-to-end voicebot that answers open domain questions.☆10Updated 3 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Updated 5 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- Spoken Language assessment☆43Updated 4 years ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Updated 5 years ago
- Speech Emotion Recognition☆40Updated last year
- A live walkthrough of leveraging real time speech to text with Watson STT.☆56Updated 3 years ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆36Updated last year
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆46Updated 9 months ago
- Computer Vision model to detect face in the first frame of a video and to continue tracking it in the rest of the video. This is implemen…☆29Updated 7 years ago