tony10101105 / HEAR-2021-NeurIPS-Challenge---NTU-GURALinks
☆13Updated 3 years ago
Alternatives and similar repositories for HEAR-2021-NeurIPS-Challenge---NTU-GURA
Users that are interested in HEAR-2021-NeurIPS-Challenge---NTU-GURA are comparing it to the libraries listed below
Sorting:
- ☆32Updated 2 years ago
- ARCH: Audio Representations benCHmark☆46Updated 11 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆57Updated last month
- ☆24Updated last year
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆53Updated last month
- ☆25Updated 8 months ago
- ☆32Updated 8 months ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Updated last year
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated 2 years ago
- Official implementation of MelHuBERT☆66Updated 9 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- ☆44Updated last year
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- experiments about AudioSet☆44Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Updated 4 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- ☆56Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆37Updated last year
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆65Updated 3 years ago
- ☆37Updated 3 years ago
- ☆25Updated 2 years ago
- ☆61Updated 2 years ago
- ☆17Updated 3 years ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year