prateekralhan / Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API
☆14Updated 2 years ago
Alternatives and similar repositories for Speech2Text-for-Long-Audio-Files:
Users that are interested in Speech2Text-for-Long-Audio-Files are comparing it to the libraries listed below
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- generates transcript for video from link☆87Updated last year
- Burn captions (.srt) into videos☆9Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Converts Youtube URLs to Text with Speech Recognition☆24Updated 2 years ago
- Speech Emotion Detection using SVM, Decision Tree, Random Forest, MLP, CNN with different architectures☆33Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆144Updated 8 months ago
- A minimalistic web app to generate transciption for audio built using Python☆30Updated last year
- ☆32Updated 2 years ago
- Package to obtain image information for things like data collection, or searching for higher resolution images☆9Updated 11 months ago
- 📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!☆19Updated 2 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆10Updated 3 weeks ago
- ☆35Updated 4 years ago
- Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.☆23Updated 3 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆47Updated 4 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…☆18Updated 5 years ago
- Open source project to integrate AI and create automated videos for YouTube☆15Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆64Updated 4 years ago
- This project is aimed at obtaining highlights from the full match video, without using computer vision and NLP.☆28Updated last year
- FlaskGPT is a minimal ChatGPT clone that uses Python, Flask, langchain and Chroma with realtime token output using SSE.☆11Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Copy the voice of anyone☆50Updated 7 years ago
- A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT l…☆78Updated last month
- Pybot can change the way learners try to learn python programming language in a more interactive way. This chatbot will try to solve or p…☆89Updated 4 years ago
- Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)☆30Updated this week
- A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 …☆67Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- ☆11Updated 2 years ago