Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
β24Aug 7, 2019Updated 6 years ago
Alternatives and similar repositories for sparse_image_warp_pytorch
Users that are interested in sparse_image_warp_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦ A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognitionβ500Jun 11, 2021Updated 4 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)β26May 2, 2020Updated 6 years ago
- PyTorch utilities for ML, specifically speechβ13Jan 30, 2024Updated 2 years ago
- MelNet-Tensorflow implementationβ41Dec 1, 2020Updated 5 years ago
- Momentum Contrast for Unsupervised Visual Representation Learningβ16Mar 24, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.β13Jun 10, 2019Updated 6 years ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learningβ23Aug 20, 2023Updated 2 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive toolsβ34Aug 20, 2019Updated 6 years ago
- Bayesian spEEch Recognizerβ55Jan 11, 2021Updated 5 years ago
- This is the pytorch implementation of "Adaptively Connected Neural Networks" for the currently popular EfficientNet and the efficient DNAβ¦β10Dec 13, 2019Updated 6 years ago
- [FG 2026] Official implementation of the paper "NullFace: Training-Free Localized Face Anonymization"β26Updated this week
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognitionβ12Mar 20, 2022Updated 4 years ago
- Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Searchβ17Jul 25, 2024Updated last year
- CMU multilingual speech repositoryβ30Apr 15, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Autoencoder and t-SNE dim-reduction to visualize the MNIST dataset (and others)β17Apr 30, 2018Updated 8 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"β28Feb 22, 2022Updated 4 years ago
- The official implementation of paper "Unsupervised Few-Shot Learning via Distribution Shift-based Augmentation"β27Apr 23, 2022Updated 4 years ago
- β13Mar 25, 2021Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Interspβ¦β28Sep 17, 2019Updated 6 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"β13Aug 2, 2024Updated last year
- β24Nov 25, 2025Updated 5 months ago
- "Brian Hears" auditory modelling toolbox for the brian2 simulatorβ26Jan 26, 2021Updated 5 years ago
- Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization at CVPR'19β48Jun 13, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β30Jun 30, 2025Updated 10 months ago
- OptKeras: wrapper around Keras and Optuna for hyperparameter optimizationβ29Apr 1, 2020Updated 6 years ago
- The Pytorch code of "Asymmetric Distribution Measure for Few-shot Learning", IJCAI 2020.β15Oct 9, 2020Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.β10Feb 22, 2022Updated 4 years ago
- Python package to compute metrics on an NLU intent parsing pipelineβ13Mar 10, 2020Updated 6 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- Visual Question Answering in PyTorchβ10Oct 22, 2025Updated 6 months ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".β57Apr 20, 2023Updated 3 years ago
- PyTorch model inference on Android GPU using MACE libraryβ10Dec 11, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for the NeurIPS 2019 paper: "Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning"β33Jun 27, 2023Updated 2 years ago
- β12Jan 4, 2022Updated 4 years ago
- Vim plugin for Bluespec SystemVerilog (BSV)β11Nov 8, 2020Updated 5 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speeβ¦β17Sep 19, 2023Updated 2 years ago
- Implicit Motion Function - (unofficial) Microsoft recreationβ29Nov 19, 2024Updated last year
- The code used to create the ARCA23K and ARCA23K-FSD datasetsβ16Nov 9, 2021Updated 4 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learningβ231Mar 23, 2021Updated 5 years ago