my-yy / s2v_rcLinks
Speech2Vec Reality Check
☆84Updated 2 years ago
Alternatives and similar repositories for s2v_rc
Users that are interested in s2v_rc are comparing it to the libraries listed below
Sorting:
- ☆143Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Updated 2 years ago
- Keras implement of Finite Scalar Quantization☆83Updated 2 years ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆193Updated 3 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 3 years ago
- Contrastively Disentangled Sequential Variational Audoencoder☆48Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆37Updated 2 years ago
- ICLR2023 statistics☆59Updated 2 years ago
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆37Updated 5 years ago
- [ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆309Updated 11 months ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Updated 2 years ago
- Representation learning for NLP @ JSALT19☆40Updated 5 years ago
- PyTorch implementation of FiLM: Visual Reasoning with a General Conditioning Layer☆64Updated 6 years ago
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆64Updated 3 years ago
- ChatGPT - Review & Rebuttal: A browser extension for generating reviews and rebuttals, powered by ChatGPT. 利用 ChatGPT 生成审稿意见和回复的浏览器插件☆251Updated 2 years ago
- A Pytorch Implementation of Finite Scalar Quantization☆165Updated 2 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆20Updated 3 years ago
- Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information☆351Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆51Updated last year
- [NeurIPS 2023] Official Implementation: "Consistent Diffusion Models"☆58Updated 2 years ago
- [ICCV 2023] Online Clustered Codebook☆182Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated 2 years ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆31Updated 9 months ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated last year
- Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>☆17Updated last year
- Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy M…☆36Updated last year
- Can audio-visual integration strengthen robustness under multimodal attacks?☆28Updated 3 years ago
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆26Updated 4 years ago
- Non-Autoregressive Predictive Coding☆51Updated 5 years ago