Tele-AI / TELEVALLinks
☆20Updated 3 months ago
Alternatives and similar repositories for TELEVAL
Users that are interested in TELEVAL are comparing it to the libraries listed below
Sorting:
- faster inference☆28Updated 10 months ago
- ☆33Updated 2 months ago
- Streaming Vocos☆29Updated 5 months ago
- (WIP)long form speech generatoins☆31Updated 7 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- ☆23Updated last year
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆52Updated 2 months ago
- Streaming Text to Speech Web UI☆22Updated last year
- ☆25Updated 5 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆58Updated 2 months ago
- The official repo of BridgeVoC, which explores using the Schrödinger Bridge framework for neural vocoding.☆74Updated this week
- Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities☆70Updated 2 months ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆42Updated 8 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Updated 10 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆40Updated last year
- ☆81Updated 4 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆48Updated 2 months ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆20Updated this week
- Llasa Speed Up☆54Updated 5 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆26Updated 2 weeks ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆33Updated 6 months ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Updated last year
- In-car multi-channel speech transcription system of AISHELL-5.☆36Updated 5 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Updated last year
- Official code of SenSE.☆64Updated 3 weeks ago
- Official Repository of UltraVoice☆46Updated 3 weeks ago
- ☆58Updated last month
- ☆58Updated last month