my-yy / s2v_rc
Speech2Vec Reality Check
☆82Updated 2 years ago
Alternatives and similar repositories for s2v_rc:
Users that are interested in s2v_rc are comparing it to the libraries listed below
- ICLR2023 statistics☆60Updated last year
- ☆127Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- Keras implement of Finite Scalar Quantization☆71Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆16Updated last year
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆187Updated 2 years ago
- Contrastively Disentangled Sequential Variational Audoencoder☆46Updated 5 months ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆28Updated 3 years ago
- ☆36Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated 2 years ago
- SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆235Updated 3 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆118Updated last year
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆61Updated last year
- Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information☆329Updated 11 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆24Updated last month
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆49Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- ☆11Updated last year
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 4 years ago
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆60Updated 2 years ago
- Beyond Straight-Through☆94Updated last year
- ☆30Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆44Updated last year
- Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"☆19Updated 5 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆45Updated 5 months ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".☆36Updated last year
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆41Updated 11 months ago