a optional way to extract audio feature
☆13Jun 10, 2017Updated 8 years ago
Alternatives and similar repositories for MRCG
Users that are interested in MRCG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Mar 21, 2018Updated 8 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- Deep Complex UNet for speech enhancement, init from "https://github.com/chanil1218/DCUnet.pytorch"☆13Feb 21, 2020Updated 6 years ago
- Torch implementation for Robust convolutional neural networks under adversarial noise☆13Mar 8, 2016Updated 10 years ago
- Fast parallel RNN-Transducer.☆10Nov 1, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- macOS audio loopback driver☆16Oct 16, 2020Updated 5 years ago
- Voice Activity Detection☆29Nov 13, 2017Updated 8 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- Diffusion Net TensorFlow implementation☆10Nov 10, 2017Updated 8 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆869Jun 9, 2021Updated 4 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated 2 months ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)☆14Apr 23, 2020Updated 5 years ago
- These are various scripts to manipulate and/or measure the acoustic properties of speech sounds☆15Oct 18, 2024Updated last year
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- ☆10May 15, 2021Updated 4 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- 北京大学 深度学习的技术与应用 课程Projects☆13May 3, 2017Updated 8 years ago
- Rainbow Keywords - Official PyTorch Implementation☆13Jun 27, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago
- ☆27Apr 21, 2017Updated 8 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- A WeChat (and Weixin) chatbot skeleton in Python with queue/delayed messages support.☆12Jan 12, 2026Updated 2 months ago
- DNN-for-speech-enhancement☆176Feb 23, 2023Updated 3 years ago
- ☆14Sep 20, 2023Updated 2 years ago