prateekralhan / Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API
☆14Updated 2 years ago
Alternatives and similar repositories for Speech2Text-for-Long-Audio-Files:
Users that are interested in Speech2Text-for-Long-Audio-Files are comparing it to the libraries listed below
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Package to obtain image information for things like data collection, or searching for higher resolution images☆9Updated last year
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Minimalist Speech-to-Text toolkit for educational purposes☆12Updated last year
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆81Updated 8 months ago
- Speech Emotion Recognition☆40Updated last year
- Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset☆10Updated 10 months ago
- User-friendly implementation of a firefox based selenium client☆17Updated 7 months ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Converts Youtube URLs to Text with Speech Recognition☆26Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago
- ☆35Updated 4 years ago
- Burn captions (.srt) into videos☆9Updated last year
- Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perfo…☆32Updated 2 months ago
- ☆32Updated 2 years ago
- Real-time speech to text with specific language translation.☆48Updated 4 years ago
- A minimalistic web app to generate transciption for audio built using Python☆32Updated last year
- Accent Classification in Speech☆25Updated 5 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- A project about learning how to synchronize subtitles in movies using machine learning.☆9Updated 2 years ago
- We will build a Flask web app that can input any long piece of information such as a blog or news article and summarize it into just five…☆16Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- A web app built with Streamlit that summarizes input text☆13Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- Spoken Language assessment☆42Updated 4 years ago
- Simple synthetic audio feature extractor☆32Updated last month