The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.
☆25Apr 14, 2022Updated 3 years ago
Alternatives and similar repositories for FMFCC-A
Users that are interested in FMFCC-A are comparing it to the libraries listed below
Sorting:
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆83Oct 21, 2021Updated 4 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆69Oct 27, 2021Updated 4 years ago
- Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"☆135Aug 30, 2024Updated last year
- ASVspoof 2021 Baseline Systems☆243Jun 6, 2024Updated last year
- CFAD: A Chinese Dataset for Fake Audio Detection☆23Jul 3, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆11May 7, 2022Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- ☆17Jan 20, 2025Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆17Jul 31, 2025Updated 7 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"☆266Jun 25, 2023Updated 2 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆15Jan 14, 2022Updated 4 years ago
- TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking☆21Apr 18, 2025Updated 10 months ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- ☆15Aug 22, 2025Updated 6 months ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 4 months ago
- ☆19Jan 8, 2025Updated last year
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- 一个第三方的泠鸢yousa歌声数据集☆17Nov 28, 2023Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year