fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆24Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 6 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 8 months ago
- Whisper2Summarize is an application that uses Whisper for audio processing and GPT for summarization. It generates summaries of audio tra…☆51Updated 2 years ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆35Updated 8 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆45Updated 8 months ago
- Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting…☆49Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆65Updated 3 weeks ago
- WhisperX Repository Modified to run on Mac☆14Updated last year
- ☆36Updated last year
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆37Updated last year
- ☆36Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- canvas-based talking head model using viseme data☆31Updated last year
- ☆63Updated last month
- A gradio interface for making transcribed and translated subtitles for videos☆39Updated 2 months ago
- Adds a web API to RVC to infer via json requests☆23Updated 9 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆35Updated 2 years ago
- A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Spe…☆35Updated 2 months ago
- ☆68Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆44Updated 4 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆107Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆56Updated 5 months ago
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- This project presents a comprehensive study on video dubbing techniques and the development of a specialized video dubbing system.☆11Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 7 months ago
- An experimental proof-of-concept script to automatically dub videos to English with the help of local TTS, voice cloning, audio separatio…☆13Updated 11 months ago