prateekralhan / Speech2Text-for-Long-Audio-Files
Perform SOTA Speech2Text on Long Audio Files with/without diarization Using Google Cloud Speech API
☆14Updated 2 years ago
Alternatives and similar repositories for Speech2Text-for-Long-Audio-Files:
Users that are interested in Speech2Text-for-Long-Audio-Files are comparing it to the libraries listed below
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Converts Youtube URLs to Text with Speech Recognition☆26Updated 2 years ago
- generates transcript for video from link☆88Updated last year
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Transcription and diarization (speaker identification)☆31Updated last year
- Real-time speech to text with specific language translation.☆48Updated 4 years ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- Scraping YouTube Video Description and Video Likes and Comments and Times and Replies! It's Automatically Extracting Data from Video.☆24Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 9 months ago
- EC499: Major Project☆9Updated last year
- ☆32Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Python project for Speech-to-Text and Sentiment Analysis. Supports English and German language!☆17Updated 3 years ago
- Extractive automatic multi-document news article summarization☆16Updated 6 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) mode…☆23Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Summarize text content into a Tweet-sized statement using OpenAI's GPT-3 based Davinci model☆24Updated last year
- A simple machine learning package to cluster keywords in higher-level groups.☆16Updated 2 years ago
- Contains colab files for making audio and video with deep fakes☆53Updated 4 years ago
- Creates video from TTS output and viseme images.☆11Updated 2 years ago
- Downloads and clips videos from youtube, rumble, bitchute (using yt-dlp) and clips the video using ffmpeg.☆21Updated this week
- Video Audio Translation Tool - automatically subtitles and dubs videos☆14Updated 4 years ago
- ☆15Updated 3 years ago
- Package to obtain image information for things like data collection, or searching for higher resolution images☆9Updated last year
- This repository consists of work done to analyse sentiment of a customer in a conversation with a call center agent using various machine…☆84Updated 5 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆119Updated last year
- Spoken Language assessment☆42Updated 4 years ago
- Speaker Identification using Neural Net.☆19Updated 6 months ago