prateekralhan / Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API
☆14Updated 3 years ago
Alternatives and similar repositories for Speech2Text-for-Long-Audio-Files:
Users that are interested in Speech2Text-for-Long-Audio-Files are comparing it to the libraries listed below
- Package to obtain image information for things like data collection, or searching for higher resolution images☆9Updated last year
- Converts Youtube URLs to Text with Speech Recognition☆27Updated 2 years ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Downloads and clips videos from youtube, rumble, bitchute (using yt-dlp) and clips the video using ffmpeg.☆22Updated last month
- Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.☆24Updated 4 years ago
- Python project for Speech-to-Text and Sentiment Analysis. Supports English and German language!☆17Updated 3 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆47Updated 5 years ago
- A Streamlit app to extract keywords using KeyBert☆36Updated 3 years ago
- This Python script is used to scrape all the video links from a youtube channel.☆54Updated 9 months ago
- ☆31Updated 2 years ago
- A minimalistic web app to generate transciption for audio built using Python☆33Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 8 months ago
- A simple machine learning package to cluster keywords in higher-level groups.☆17Updated 2 years ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 8 months ago
- ☆23Updated 9 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- Summarize text content into a Tweet-sized statement using OpenAI's GPT-3 based Davinci model☆23Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Minute Meeting Bot☆18Updated 2 years ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆19Updated last week
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- ☆23Updated 4 years ago
- Transcription and diarization (speaker identification)☆34Updated last year
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆28Updated 6 months ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆60Updated 3 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆60Updated 7 years ago