fizamusthafa / whisper-app
This repository contains a web application for multi-lingual transcription using OpenAI's Whisper Automatic Speech Recognition (ASR) model. Users can upload audio files in WAV, MP3, or M4A formats and get transcriptions in various languages. The application is designed with accessibility and data privacy in mind.
☆20Updated last year
Alternatives and similar repositories for whisper-app:
Users that are interested in whisper-app are comparing it to the libraries listed below
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆84Updated last week
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆47Updated 5 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆123Updated this week
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆54Updated 3 months ago
- Burn captions (.srt) into videos☆9Updated last year
- A gradio interface for making transcribed and translated subtitles for videos☆33Updated last year
- This project use the Meta NLLB-200 translation model through the Hugging Face transformers library.☆61Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated last year
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆62Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆174Updated 3 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- web based editor for subtitles and transcripts☆119Updated 5 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... Fast!!☆28Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆83Updated 8 months ago
- Takes a youtube video, clones the voice and re-creates that video in a different language☆101Updated 10 months ago
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 3 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆57Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆61Updated 7 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆18Updated 3 months ago
- Simulates talk with an AI that can express emotions☆43Updated 5 months ago
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆132Updated 2 months ago
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 5 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆33Updated last week
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆41Updated 10 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆25Updated 5 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆45Updated 2 years ago
- Tools for making LJSpeech datasets☆22Updated 11 months ago