Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
β24Aug 7, 2019Updated 6 years ago
Alternatives and similar repositories for sparse_image_warp_pytorch
Users that are interested in sparse_image_warp_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦ A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognitionβ501Jun 11, 2021Updated 4 years ago
- DEPRECATED version of SoundFileβ14May 26, 2020Updated 5 years ago
- SWIG bindings for Kaldi I/O, built with Condaβ15Dec 15, 2024Updated last year
- MelNet-Tensorflow implementationβ41Dec 1, 2020Updated 5 years ago
- Pytorch Bindings for warp-ctc maintained by ESPnetβ17Feb 20, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An attempt at genre classification with convolutional neural networks and spectrogramsβ15Nov 25, 2017Updated 8 years ago
- Momentum Contrast for Unsupervised Visual Representation Learningβ16Mar 24, 2023Updated 3 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.β13Jun 10, 2019Updated 6 years ago
- β32Jul 17, 2025Updated 10 months ago
- This is the pytorch implementation of "Adaptively Connected Neural Networks" for the currently popular EfficientNet and the efficient DNAβ¦β10Dec 13, 2019Updated 6 years ago
- β29Apr 8, 2025Updated last year
- Generative Refinement Networks for Visual Synthesisβ101May 13, 2026Updated last week
- multilingual speech alignerβ77Nov 19, 2023Updated 2 years ago
- Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Searchβ17Jul 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 1,440 audio files (.wav), i.e. speech files, from 24 actors that are categorized into 8 separate emotions.β15Feb 11, 2019Updated 7 years ago
- CMU multilingual speech repositoryβ30Apr 15, 2022Updated 4 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"β28Feb 22, 2022Updated 4 years ago
- The official implementation of paper "Unsupervised Few-Shot Learning via Distribution Shift-based Augmentation"β26Apr 23, 2022Updated 4 years ago
- β10Apr 27, 2021Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Interspβ¦β28Sep 17, 2019Updated 6 years ago
- [NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"β13Aug 2, 2024Updated last year
- β25Nov 25, 2025Updated 6 months ago
- β21Jan 19, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch implementation of Res2Netβ114Apr 28, 2019Updated 7 years ago
- β13Jun 14, 2024Updated last year
- Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization at CVPR'19β48Jun 13, 2019Updated 6 years ago
- β23Jul 5, 2025Updated 10 months ago
- The Pytorch code of "Asymmetric Distribution Measure for Few-shot Learning", IJCAI 2020.β15Oct 9, 2020Updated 5 years ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMsβ13Feb 13, 2024Updated 2 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- A Pytorch implementation of https://arxiv.org/abs/1810.12348.β37Feb 19, 2019Updated 7 years ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".β57Apr 20, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Webtoons.com downloaderβ11May 24, 2022Updated 4 years ago
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (ACβ¦β12Dec 30, 2021Updated 4 years ago
- ηΎεΊ¦AIζ―θ΅οΌηΉε»εζ¬Ίθ―ι’ζ΅οΌζε7/424β14Jul 19, 2020Updated 5 years ago
- DNI (Decoupled Neural Interfaces using Synthetic Gradients) Implementation with Tensorflow.β28Jan 26, 2018Updated 8 years ago
- β12Apr 26, 2018Updated 8 years ago
- Code for the NeurIPS 2019 paper: "Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning"β33Jun 27, 2023Updated 2 years ago
- Vim plugin for Bluespec SystemVerilog (BSV)β12Nov 8, 2020Updated 5 years ago