BBC-Esq / WhisperS2T-transcriber
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for WhisperS2T-transcriber
- Real-time end-to-end singing voice convertion☆18Updated last week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆39Updated 2 weeks ago
- Text-to-Music Generation with Rectified Flow Transformer☆45Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated this week
- ☆19Updated 2 weeks ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆23Updated last year
- AudioLDM text to audio colab☆19Updated last year
- ☆9Updated last month
- ☆26Updated 10 months ago
- ☆22Updated 10 months ago
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated this week
- Heteronym to Phoneme Parser☆15Updated last year
- VALL-E 2 reproduction☆83Updated 3 months ago
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- Text-To-Speech for NotebookLM☆16Updated last week
- Simple PyTorch Denoisers for Waveform Audio☆32Updated 3 weeks ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- ☆87Updated 6 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆38Updated last month
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆26Updated this week
- Production-ready vocoder using BigVSAN☆11Updated 8 months ago
- Create an LJSpeech structured voice dataset on wave input☆19Updated last month
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Site for sharing MusicGen + AudioGen Prompts and Creations☆39Updated 4 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆32Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- DeepFloyd IF web UI☆29Updated last year
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆18Updated 11 months ago
- On-device speaker diarization powered by deep learning☆25Updated last month