A Pytorch implementation of WaveNet ASR (Automatic Speech Recognition)
☆12Sep 22, 2021Updated 4 years ago
Alternatives and similar repositories for Pytorch-ASR-WaveNet
Users that are interested in Pytorch-ASR-WaveNet are comparing it to the libraries listed below
Sorting:
- 💻 🐈 Added a self-attention layer to the CycleGAN implementation (PyTorch).☆13May 31, 2024Updated last year
- Emotional Speech Conversion using Nonparallel Data☆17Apr 10, 2019Updated 6 years ago
- Codes, datasets, and synthetic dataset generator about the paper "LiCamPose: Combining Multi-View LiDAR and RGB Cameras for Robust Single…☆16Feb 28, 2026Updated 3 weeks ago
- ☆11Oct 6, 2025Updated 5 months ago
- [ICCV 2025] A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data☆16Dec 17, 2025Updated 3 months ago
- Voice conversion using deep adversarial learning☆17Oct 29, 2021Updated 4 years ago
- Diffusion Model for Voice Conversion☆17Oct 11, 2022Updated 3 years ago
- Git mirror of ImageStack☆11Aug 15, 2012Updated 13 years ago
- An implementation of the TRACLUS algorithm, A Partition-and-Group Framework (http://hanj.cs.illinois.edu/pdf/sigmod07_jglee.pdf).☆25Apr 17, 2023Updated 2 years ago
- brings autocomplete to Quill Placeholder module☆12Sep 28, 2018Updated 7 years ago
- 安徽大学计算机视觉大作业——人体关键点检测及其在AI运动中的应用☆10Dec 11, 2022Updated 3 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- Squad Mortar Overlay: Overlays Squad map with the SquadCalc map☆12Jan 29, 2025Updated last year
- Find how to pronounce words by breaking them up into their phones.☆24Jul 7, 2017Updated 8 years ago
- WaveNet Introduction☆37May 10, 2019Updated 6 years ago
- JavaScript libraries to interact with the Ispikit pronunciation assessment server☆11Nov 16, 2016Updated 9 years ago
- Tactical Observation of RF GNSS Interference☆14Jun 25, 2020Updated 5 years ago
- VAE with Attention Mechanism for a more powerful representation of interactions☆21Jun 29, 2019Updated 6 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆11Jun 20, 2025Updated 9 months ago
- ☆10Mar 13, 2024Updated 2 years ago
- In this repository, I have developed a CycleGAN architecture with embedded Self-Attention Layers, that could solve three different comple…☆25Jan 26, 2022Updated 4 years ago
- Some Top-Down 2D Pose Estimation☆17Nov 7, 2020Updated 5 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- ☆20Jan 26, 2021Updated 5 years ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆13May 3, 2024Updated last year
- ☆35Aug 31, 2025Updated 6 months ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- ☆16Jul 23, 2023Updated 2 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆16Feb 17, 2025Updated last year
- Multiple instance learning bag generation code using data from the ECOSTRESS Spectral Library V1.0.☆13Mar 25, 2020Updated 5 years ago
- ☆12Mar 24, 2021Updated 4 years ago
- Learning Diffusion Models for Multi-View Anomaly Detection [ECCV2024]☆15Oct 16, 2024Updated last year
- DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactu…☆10Oct 9, 2024Updated last year
- Detection of abnormal patterns in electricity usage via time series forecasting☆11Apr 15, 2018Updated 7 years ago
- Y. Wu, L. Jiao, X. Liu, F. Liu, S. Yang and L. Li, Domain Adaptation-aware Transformer for Hyperspectral Object Tracking. IEEE Transactio…☆12Jul 15, 2024Updated last year
- [IEEE GRSL 2017] Group Lasso-Based Band Selection for Hyperspectral Image Classification☆11Dec 25, 2017Updated 8 years ago
- Some PyTorch code for the Kaggle Speech Recognition Challenge☆12Feb 7, 2019Updated 7 years ago