unilight / sheet
Speech Human Evaluation Estimation Toolkit (SHEET)
☆33Updated this week
Related projects ⓘ
Alternatives and complementary repositories for sheet
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- ☆47Updated 4 months ago
- ☆27Updated last year
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆82Updated last month
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 2 weeks ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆44Updated last week
- Source code of APNet2, a vocoder☆51Updated 11 months ago
- ☆21Updated 5 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆22Updated last year
- ☆22Updated 7 months ago
- ☆30Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆62Updated last month
- Implementation of SpatialCodec.☆51Updated last year
- ☆46Updated last week
- ☆40Updated 3 weeks ago
- ☆18Updated 2 months ago
- ☆34Updated 4 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆29Updated 11 months ago
- ☆41Updated 3 weeks ago
- The open source code for SimpleSpeech series☆108Updated last month
- PAM is a no-reference audio quality metric for audio generation tasks☆48Updated 3 months ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- ☆53Updated 10 months ago
- ☆48Updated this week
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆17Updated last month
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆60Updated last week
- Unofficial implementation of NANSY++ in Pytorch Lightning☆48Updated 8 months ago
- ARCH: Audio Representations benCHmark☆36Updated 2 months ago