xjchenGit / SingGraph
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
☆22Updated last month
Related projects ⓘ
Alternatives and complementary repositories for SingGraph
- ☆47Updated last week
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 weeks ago
- ☆34Updated 5 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆39Updated last week
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- This repository presents a subset of our proposed FSD dataset for song deepfake detection.☆19Updated 2 months ago
- ☆20Updated 10 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆38Updated last month
- Spherical residual vector quantization (SRVQ)☆26Updated 2 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- ☆29Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 7 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- ☆15Updated 4 months ago
- A neural speech codec based on discrete WavLM representations☆21Updated 2 months ago
- ☆42Updated last month
- ☆19Updated 2 months ago
- ☆59Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆22Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆82Updated 2 months ago
- ☆40Updated 5 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆49Updated this week
- ☆45Updated last month
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆11Updated this week
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆29Updated 11 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year