mct10 / CoBERTLinks

Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning

☆47

Alternatives and similar repositories for CoBERT

Users that are interested in CoBERT are comparing it to the libraries listed below

Sorting:

mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
Hertin / WavPrompt
☆37Updated 3 years ago
ga642381 / SpeechGen
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
☆74Updated 2 years ago
ga642381 / SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
☆81Updated last year
nervjack2 / MelHuBERT
Official implementation of MelHuBERT
☆66Updated 9 months ago
sungnyun / ARMHuBERT
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆40Updated 11 months ago
Sreyan88 / LipGER
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆17Updated last year
vectominist / spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆56Updated 2 years ago
ddlBoJack / MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆44Updated last year
0nutation / SLMTokBench
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
☆37Updated last year
JSALT-2022-SSL / superb-prosody
☆32Updated 2 years ago
mutiann / speech_rankings
A CSRankings-like index for speech researchers
☆34Updated 9 months ago
AbrahamSanders / codec-bpe
Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
☆66Updated 2 weeks ago
WangHelin1997 / SpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Updated 4 years ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Updated 3 months ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
karthikbhamidipati / multi-task-speech-classification
Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset
☆27Updated 2 months ago
NKU-HLT / KNN-CTC
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆39Updated last year
WangHelin1997 / SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆77Updated last year
jishengpeng / WavReward
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
☆50Updated 2 months ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆82Updated 2 years ago
tts-tutorial / icassp2022
☆64Updated 3 years ago
walker-hyf / NCSSD
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Updated 9 months ago
Alexander-H-Liu / dinosr
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆49Updated last year
fengpeng-yue / speech-to-speech-translation
☆25Updated 2 years ago
youngsheen / GPST
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆60Updated 9 months ago
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆57Updated last month
DanielLin94144 / StyleTalk
Official release of StyleTalk dataset.
☆67Updated last year
yanghaha0908 / FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆94Updated 8 months ago
zeyuxie29 / AudioTime
☆33Updated last year