Tera2Space / AudioAELinks
Simple audio AE
☆13Updated last year
Alternatives and similar repositories for AudioAE
Users that are interested in AudioAE are comparing it to the libraries listed below
Sorting:
- T5-based (russian) text normalization☆24Updated last year
- ☆13Updated 3 years ago
- 🎵 muse: Music Separation☆10Updated last year
- Normalize Text in Russian☆28Updated 2 years ago
- ☆60Updated 3 weeks ago
- ☆43Updated 6 months ago
- Russian accentuator and IPA transcriber☆16Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 7 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆34Updated last year
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 8 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆19Updated last year
- Fast CosyVoice3 inference with tensorRT and tensorRT-LLM☆20Updated last week
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated 11 months ago
- ☆13Updated 4 years ago
- Neural model for prediction of stress position in Russian words☆11Updated 6 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 9 months ago
- ☆21Updated 6 years ago
- Use quantized versions of Whisper to speed up inference☆12Updated last year
- ☆49Updated 5 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 6 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 7 months ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated 3 years ago
- ☆16Updated 8 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 7 months ago
- ☆25Updated last year
- Простой нормализатор текстов перед синтезом речи☆41Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- a repository for trainabale tts multi speaker☆14Updated 4 years ago