pohanchi/AALBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pohanchi/AALBERT)

pohanchi / AALBERT

The official repository for Audio ALBERT

☆68

Alternatives and similar repositories for AALBERT

Users that are interested in AALBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

grtzsohalf / buy_vs_rent_and_invest
View on GitHub
☆15Sep 9, 2021Updated 4 years ago
yistLin / human-evaluation
View on GitHub
☆23Aug 21, 2020Updated 5 years ago
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago
yistLin / universal-vocoder
View on GitHub
A PyTorch implementation of the universal neural vocoder
☆68Nov 6, 2020Updated 5 years ago
d223302 / albert-embryology
View on GitHub
☆13Oct 28, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yistLin / beamertheme-yeast
View on GitHub
Yeast, a lite and light beamer theme
☆18Dec 6, 2020Updated 5 years ago
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 5 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
pohanchi / huggingface_albert
View on GitHub
hugginface albert model and its tokenizer
☆15Mar 12, 2020Updated 6 years ago
yistLin / FragmentVC
View on GitHub
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆204Nov 30, 2020Updated 5 years ago
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
ga642381 / FlappyBird
View on GitHub
Super Flappy Bird in p5.js
☆10Mar 8, 2021Updated 5 years ago
ga642381 / SpeechPrompt
View on GitHub
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Apr 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vectominist / spin
View on GitHub
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆65May 19, 2023Updated 3 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
BogiHsu / WG-WaveNet
View on GitHub
Real-Time High-Fidelity Speech Synthesis without GPU
☆73Jul 29, 2024Updated last year
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
yistLin / dvector
View on GitHub
Speaker embedding (d-vector) trained with GE2E loss
☆289Jan 8, 2024Updated 2 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
MiuLab / TaylorGAN
View on GitHub
☆31Apr 24, 2021Updated 5 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
andi611 / ZeroSpeech-TTS-without-T
View on GitHub
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Nov 12, 2019Updated 6 years ago
KimythAnly / qqdm
View on GitHub
A lightweight, fast and pretty progress bar for Python
☆47Dec 19, 2025Updated 7 months ago
ga642381 / Taiwanese-Translation
View on GitHub
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
☆13Oct 15, 2022Updated 3 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
Wendison / FCL-taco2
View on GitHub
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
☆41Jul 17, 2021Updated 5 years ago
sky1456723 / Pytorch-MBNet
View on GitHub
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆62Sep 24, 2021Updated 4 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
chaitanya100100 / Relative-Attributes-Zero-Shot-Learning
View on GitHub
Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 8 years ago