☆31Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for EmoGator
Users that are interested in EmoGator are comparing it to the libraries listed below
Sorting:
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- Hume AI ML Competitions☆27Oct 28, 2022Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆96Apr 5, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆32Updated this week
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆161Nov 12, 2022Updated 3 years ago
- ☆41Jan 13, 2022Updated 4 years ago
- A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.☆23Updated this week
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆63Sep 8, 2025Updated 5 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Mar 10, 2024Updated last year
- SSL Layerwise analysis for speech deepfake detection☆32Aug 5, 2025Updated 7 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Sep 24, 2021Updated 4 years ago
- A differentiable version of SPTK☆193Feb 26, 2026Updated last week
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- This is the GitHub page for publicly available emotional speech data.☆381Jan 6, 2022Updated 4 years ago
- The official implementation of TokenSynth (ICASSP 2025)☆79Oct 27, 2025Updated 4 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 4 months ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆134Nov 29, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆92Dec 28, 2024Updated last year
- ☆12Jul 10, 2018Updated 7 years ago
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆108Jan 17, 2025Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆51Apr 18, 2025Updated 10 months ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Jan 7, 2023Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆84Oct 11, 2024Updated last year
- The electronic Holly Quran browser Elforkane☆11Nov 14, 2021Updated 4 years ago
- A tool to easy reimport waves in Wwise project under P4V☆11Dec 18, 2024Updated last year
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆38Feb 17, 2025Updated last year