my-yy / s2v_rc
Speech2Vec Reality Check
☆81Updated 2 years ago
Alternatives and similar repositories for s2v_rc:
Users that are interested in s2v_rc are comparing it to the libraries listed below
- ☆124Updated last year
- Keras implement of Finite Scalar Quantization☆70Updated last year
- ICLR2023 statistics☆60Updated last year
- A Pytorch Implementation of Finite Scalar Quantization☆112Updated last year
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆59Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆16Updated last year
- Representation learning for NLP @ JSALT19☆38Updated 4 years ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆186Updated 2 years ago
- Beyond Straight-Through☆93Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆44Updated 11 months ago
- SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆213Updated 2 months ago
- Contrastively Disentangled Sequential Variational Audoencoder☆46Updated 4 months ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 4 years ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated last year
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Updated 3 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- ☆35Updated last year
- code for paper "Cross-modal Contrastive Learning for Speech Translation" (NAACL 2022)☆64Updated 2 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆37Updated last year
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Updated last year
- ☆75Updated 4 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated 10 months ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆20Updated last year
- [ICCV 2023] Online Clustered Codebook☆160Updated 5 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- ☆30Updated last year
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆106Updated 2 years ago