earthspecies / unsupervised-speech-translation
☆9Updated 3 years ago
Alternatives and similar repositories for unsupervised-speech-translation:
Users that are interested in unsupervised-speech-translation are comparing it to the libraries listed below
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- ☆15Updated 2 years ago
- Code for making #GANterpretations☆23Updated 4 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- ☆15Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- ☆31Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Updated this week
- Speech in Flax/JAX☆15Updated 2 years ago
- Contrastive Language-Audio Pretraining☆87Updated 2 years ago
- VIsually-Pivoted Audio and(N) Text☆22Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆66Updated 2 years ago
- ☆32Updated 4 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- High performance pytorch modules☆18Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆22Updated last year
- A Pytorch Implementations for Various Vector Quantization Methods☆27Updated 3 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Updated last year
- bumble bee transformer☆14Updated 3 years ago
- ☆32Updated 3 years ago
- ☆23Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year