BTawaifi / Audio-DeSilencerLinks
An audio processing tool for detecting and removing silence in audio recordings. Create text files for video silence removal using custom-defined thresholds.
☆23Updated 2 weeks ago
Alternatives and similar repositories for Audio-DeSilencer
Users that are interested in Audio-DeSilencer are comparing it to the libraries listed below
Sorting:
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 7 months ago
- ☆39Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆25Updated last week
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated 8 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆62Updated 6 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆72Updated 11 months ago
- web based editor for subtitles and transcripts☆133Updated 9 months ago
- ☆16Updated last year
- ☆54Updated last year
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- Site for sharing Bark voices☆51Updated 2 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆61Updated 8 months ago
- Face Swap☆12Updated 2 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆19Updated 2 weeks ago
- ☆27Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- ☆83Updated 11 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated last year
- ☆40Updated last year
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- an improved version of Real-time-voice-cloning☆50Updated last year
- ☆38Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated this week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- ☆14Updated 6 months ago
- SeamlessM4t-Translator: Utilizing the powerful Seamless M4t Facebook model in the backend, this project facilitates seamless translation …☆12Updated last year
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆20Updated last month