Ashigarg123 / ShiftySpeechView external linksLinks
☆14Jul 24, 2025Updated 6 months ago
Alternatives and similar repositories for ShiftySpeech
Users that are interested in ShiftySpeech are comparing it to the libraries listed below
Sorting:
- text to speech☆10Mar 19, 2024Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- SSL Layerwise analysis for speech deepfake detection☆32Aug 5, 2025Updated 6 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆36Oct 15, 2024Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- The pytorch implementation of BAM for Partialspoof Audio Localization.☆28Aug 16, 2024Updated last year
- [T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆30Jul 31, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆11Nov 7, 2024Updated last year
- ☆10Apr 17, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 9 months ago
- ☆103Nov 14, 2025Updated 3 months ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Sep 6, 2023Updated 2 years ago
- [ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs☆43May 23, 2025Updated 8 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Feb 5, 2025Updated last year
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 4 months ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 5 months ago
- ☆15Nov 10, 2025Updated 3 months ago
- Baselines for IS25 Source Tracing Special Session☆33Jan 3, 2025Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆15Jun 22, 2025Updated 7 months ago
- Implementation of Google's USM speech model in Pytorch☆34Feb 7, 2026Updated last week
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- ☆14Aug 19, 2024Updated last year