fizamusthafa / whisper-appLinks
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆32Updated 3 months ago
Alternatives and similar repositories for whisper-app
Users that are interested in whisper-app are comparing it to the libraries listed below
Sorting:
- Input a YouTube video link or upload a video file and get a video with subtitles.☆124Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆68Updated 2 years ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆41Updated 11 months ago
- streaming speech to text server using Whisper☆101Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆249Updated 5 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Takes a youtube video, clones the voice and re-creates that video in a different language☆110Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆74Updated last year
- Voice models for Mimic 3 text to speech system☆161Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆168Updated 2 weeks ago
- The program for automatic dubbing any video file for a lot of languages.☆86Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Voice cloning AI (deepfake for voice). Using cloned voice from only 5-10 seconds of targeted voice.☆70Updated 4 years ago
- A gradio interface for making transcribed and translated subtitles for videos☆42Updated 11 months ago
- Talking head video AI generator☆82Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, a…☆163Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- ☆75Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆353Updated 6 months ago
- An automatic speech recognition API☆78Updated last week
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆87Updated 2 years ago
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Play.ht's Text to Speech API☆94Updated 5 months ago