justinjohn0306 / SpeedScribeLinks
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback.
☆10Updated 8 months ago
Alternatives and similar repositories for SpeedScribe
Users that are interested in SpeedScribe are comparing it to the libraries listed below
Sorting:
- ☆39Updated last year
- ☆14Updated last year
- ☆13Updated 9 months ago
- ☆21Updated 10 months ago
- ☆12Updated last year
- ☆32Updated last year
- ☆23Updated 8 months ago
- ☆16Updated last year
- ☆18Updated last year
- ☆19Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆30Updated last year
- ☆27Updated last year
- ☆19Updated 10 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last month
- ☆22Updated last year
- ☆12Updated last year
- ☆40Updated last year
- ☆20Updated last year
- ☆37Updated last year
- ☆24Updated last year
- ☆22Updated last year
- ☆11Updated 9 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆17Updated 6 months ago
- ☆24Updated last year
- ☆14Updated last year
- ☆22Updated last year
- Generate images from an initial frame and text☆37Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- AudioLDM text to audio colab☆19Updated last year
- ☆43Updated last year