shinjiwlab / versa

☆51

Related projects ⓘ

Alternatives and complementary repositories for versa

soumimaiti / speechlmscore_tool
☆27Updated last year
unilight / sheet
Speech Human Evaluation Estimation Toolkit (SHEET)
☆39Updated last week
xinjli / alqalign
multilingual speech aligner
☆72Updated last year
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆27Updated last year
haoxiangsnr / llm-tse
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆32Updated last year
haiciyang / LaDiffCodec
☆47Updated last week
Aria-K-Alethia / BigCodec
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆82Updated 2 months ago
ftshijt / Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Updated 9 months ago
Takaaki-Saeki / DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
☆109Updated 9 months ago
sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆61Updated 3 years ago
unilight / LDNet
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆61Updated 2 years ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆59Updated last year
kan-bayashi / LibriTTSLabel
Alignment files of LibriTTS.
☆60Updated 4 years ago
yzyouzhang / Audio_Research_in_US
For students who would like to apply for RA, PhD, postdoc in audio research.
☆24Updated 3 weeks ago
kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆35Updated 8 months ago
Aria-K-Alethia / laughter-synthesis
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆71Updated last year
JasonSWFu / VQscore
☆36Updated 5 months ago
ga642381 / AudioCodec-Hub
AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models
☆22Updated last year
ftshijt / speech_evaluation
A toolkit dedicate for speech evaluation.
☆18Updated last month
shinjiwlab / cmu_multilingual_speech
CMU multilingual speech repository
☆31Updated 2 years ago
Alexander-H-Liu / dinosr
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆47Updated 10 months ago
hhguo / SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆63Updated 2 months ago
NTIA / alignnet
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆11Updated last month
3loi / NaturalVoices
☆47Updated 2 weeks ago
Dapwner / CVAE-Tacotron
☆23Updated 5 months ago
fgnt / mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
☆43Updated last month
cheoljun95 / sdhubert
☆17Updated this week
cpdu / unicats
☆62Updated 10 months ago
BUTSpeechFIT / EEND_dataprep
☆49Updated 6 months ago