☆23Jan 29, 2026Updated last month
Alternatives and similar repositories for TELEVAL
Users that are interested in TELEVAL are comparing it to the libraries listed below
Sorting:
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆11May 7, 2022Updated 3 years ago
- ☆36Sep 6, 2025Updated 5 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆36Dec 24, 2025Updated 2 months ago
- ☆23Oct 17, 2024Updated last year
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 10 months ago
- Llasa Speed Up☆60Jan 18, 2026Updated last month
- ☆40Jul 15, 2025Updated 7 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆23Dec 6, 2025Updated 2 months ago
- ☆22Jul 30, 2025Updated 6 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆105May 5, 2025Updated 9 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- 基于vits fastspeech2 visinger的tts模型☆24Mar 9, 2023Updated 2 years ago
- The baselines of ARC-Challenge-Interspeech2026☆56Dec 1, 2025Updated 2 months ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆41Jun 12, 2025Updated 8 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 3 months ago