my-yy / s2v_rc
Speech2Vec Reality Check
☆77Updated last year
Related projects ⓘ
Alternatives and complementary repositories for s2v_rc
- ☆118Updated 8 months ago
- ICLR2023 statistics☆60Updated last year
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆53Updated 2 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆35Updated 11 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆26Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆15Updated last year
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 4 years ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated 10 months ago
- Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)☆181Updated 2 years ago
- This repo contains script to download MUSIC dataset from youtube☆8Updated 10 months ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆43Updated 7 months ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated last year
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆49Updated 7 months ago
- Representation learning for NLP @ JSALT19☆36Updated 4 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆46Updated 2 years ago
- Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information☆312Updated 6 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆88Updated 11 months ago
- This package aims at simplifying the download of the AudioCaps dataset.☆30Updated 11 months ago
- Source code for the paper 'Audio Captioning Transformer'☆50Updated 2 years ago
- Keras implement of Finite Scalar Quantization☆64Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆37Updated 3 months ago
- Contrastively Disentangled Sequential Variational Audoencoder☆45Updated last month
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆46Updated 9 months ago
- Beyond Straight-Through☆90Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- ☆29Updated last year
- ☆33Updated 10 months ago
- ☆10Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆83Updated 2 years ago