grtzsohalf/SpeechNet-codebase

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/grtzsohalf/SpeechNet-codebase)

grtzsohalf / SpeechNet-codebase

☆21

Alternatives and similar repositories for SpeechNet-codebase

Users that are interested in SpeechNet-codebase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

grtzsohalf / buy_vs_rent_and_invest
View on GitHub
☆15Sep 9, 2021Updated 4 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 4 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
pohanchi / huggingface_albert
View on GitHub
hugginface albert model and its tokenizer
☆15Mar 12, 2020Updated 6 years ago
v-manhlt3 / Disentangle-VAE-for-VC
View on GitHub
☆23Dec 10, 2024Updated last year
TeaPoly / CE-OptimizedLoss
View on GitHub
Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…
☆25Oct 11, 2024Updated last year
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
ga642381 / AudioCodec-Hub
View on GitHub
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
☆25Sep 26, 2023Updated 2 years ago
nwpuaslp / kws_mia
View on GitHub
☆11Apr 20, 2020Updated 6 years ago
yistLin / universal-vocoder
View on GitHub
A PyTorch implementation of the universal neural vocoder
☆68Nov 6, 2020Updated 5 years ago
cylin-cmlab / GCT-Prediction
View on GitHub
This is the official supplementary document for the GCT data and its prediction task.
☆10Feb 19, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆30Mar 24, 2026Updated 3 months ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
SolomidHero / real-time-voice-conversion
View on GitHub
Toolbox for easy and qualitative one-shot voice conversion
☆48Dec 5, 2021Updated 4 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
Top34051 / stargan-zsvc
View on GitHub
Unofficial PyTorch Implementation of StarGAN-ZSVC
☆14Aug 5, 2021Updated 4 years ago
NeelayS / speech_spike_signatures
View on GitHub
Spiking neural networks (SNNs) for speech classification
☆12Mar 14, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
tihbe / python-ebdataset
View on GitHub
An event based dataset loader under one common python API.
☆10Mar 22, 2022Updated 4 years ago
deepspike / tandem_learning
View on GitHub
The source code for the paper entitled "A Tandem Learning Rule for Effective Training and Rapid Inference of Deep Spiking Neural Networks…
☆14Jul 4, 2021Updated 5 years ago
xrenaa / Retriever
View on GitHub
[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"
☆54Oct 19, 2022Updated 3 years ago
exeex / maps-dataset
View on GitHub
MAPS ( MIDI Aligned Piano Sounds ) dataset python api for machine learning
☆11Jun 26, 2018Updated 8 years ago
Delver-of-Squeakrets / LISNN
View on GitHub
Code for the model presented in the paper "LISNN: Improving Spiking Neural Networks with Lateral Interactions for Robust Object Recogniti…
☆13Nov 20, 2020Updated 5 years ago
BogiHsu / WG-WaveNet
View on GitHub
Real-Time High-Fidelity Speech Synthesis without GPU
☆73Jul 29, 2024Updated last year
pvili / SpikingTimeDependentPlasticity
View on GitHub
The code to simulate spiking neural networks as used in the paper "Spiking Time-Dependent Plasticity Leads to Efficient Coding of Predict…
☆10Nov 24, 2019Updated 6 years ago
voidful / FTA
View on GitHub
Technical Analysis on Cryptocurrency
☆25Oct 14, 2025Updated 9 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
seongsikpark / SNN-neural-coding
View on GitHub
Deep SNNs with various neural coding methods (rate, phase, burst, TTFS)
☆12Feb 15, 2022Updated 4 years ago
Shellbye / hanzi2pinyin
View on GitHub
C++版本的汉字转拼音 Transfer chinese character to pinyin
☆14Aug 31, 2018Updated 7 years ago
sky1456723 / Pytorch-MBNet
View on GitHub
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆62Sep 24, 2021Updated 4 years ago
zhai-lw / L3AC
View on GitHub
A lightweight audio codec based on a single quantizer
☆35Sep 4, 2025Updated 10 months ago
huckiyang / awesome-neural-reprogramming-prompting
View on GitHub
A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022
☆40Nov 30, 2023Updated 2 years ago
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago