jefflai108 / PARP-wav2vec-PyTorchLinks

☆9

Alternatives and similar repositories for PARP-wav2vec-PyTorch

Users that are interested in PARP-wav2vec-PyTorch are comparing it to the libraries listed below

Sorting:

B06901052 / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆13Updated 2 years ago
iamyuanchung / VQ-APC
Vector Quantized Autoregressive Predictive Coding (VQ-APC)
☆37Updated 4 years ago
huckiyang / awesome-neural-reprogramming-prompting
A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022
☆36Updated last year
Alexander-H-Liu / NPC
Non-Autoregressive Predictive Coding
☆51Updated 4 years ago
jasonppy / FaST-VGS-Family
Transformer-based visually grounded speech models
☆19Updated 2 years ago
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆55Updated 2 years ago
Hertin / WavPrompt
☆37Updated 3 years ago
grtzsohalf / SpeechNet-codebase
☆20Updated 4 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆37Updated last year
asappresearch / wav2seq
Official code for Wav2Seq
☆95Updated 3 years ago
wnhsu / ResDAVEnet-VQ
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
☆27Updated 3 years ago
GATECH-EIC / S3-Router
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Updated last year
audio-captioning / audio-captioning-resources
A list of resources that can help in research for automated audio captioning
☆34Updated 4 years ago
Alexander-H-Liu / dinosr
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆49Updated last year
yistLin / human-evaluation
☆22Updated 4 years ago
akhilmathurs / libriadapt
Instructions on downloading and using the LibriAdapt dataset
☆46Updated 3 years ago
bshall / VectorQuantizedCPC
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Updated 4 years ago
Ydkwim / CTAL
Pre-training Cross-modal Transformer for Audio-and-Language Representations
☆39Updated 4 years ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Updated 4 months ago
pohanchi / AALBERT
The official repository for Audio ALBERT
☆66Updated 3 years ago
jasonppy / word-discovery
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆26Updated last year
felixgontier / dcase-2023-baseline
☆14Updated 2 years ago
asappresearch / slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆65Updated last year
felixkreuk / UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆141Updated 3 years ago
LeBenchmark / Interspeech2021
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆51Updated 3 years ago
WangHelin1997 / AT-GCN
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆13Updated 4 years ago
audioku / cross-accent-maml-asr
Meta-learning model agnostic (MAML) implementation for cross-accented ASR
☆44Updated last year
Splend1d / T5lephone
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Updated 2 years ago