a optional way to extract audio feature
☆13Jun 10, 2017Updated 8 years ago
Alternatives and similar repositories for MRCG
Users that are interested in MRCG are comparing it to the libraries listed below
Sorting:
- ☆10Mar 21, 2018Updated 7 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆28May 1, 2024Updated last year
- Voice Activity Detection☆29Nov 13, 2017Updated 8 years ago
- Transfer Learning using state-of-the-art CNN architectures (ResNet34 and Xception). Class engineering, learning rate/weight decay tuning …☆11Jun 11, 2019Updated 6 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- A WeChat (and Weixin) chatbot skeleton in Python with queue/delayed messages support.☆12Jan 12, 2026Updated last month
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- Exploratory notebook . Techniques used: FFT, ARIMA, GARCH, Monte Carlo Simulations, fbprophet, LSTM, WaveNet.☆11Jul 11, 2022Updated 3 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- These are various scripts to manipulate and/or measure the acoustic properties of speech sounds☆15Oct 18, 2024Updated last year
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- The MAFAT challenge, by the Israeli Department of Defense. Deep Learning based approach to classify radar signatures of humans and animal…☆10Nov 10, 2020Updated 5 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- Detect Duplicate Images Blazingly Fast☆12Dec 3, 2021Updated 4 years ago
- Metric Learning Library for Keras☆10Apr 24, 2019Updated 6 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Stock/ETF auto-trading code in R for IBAPI☆10Aug 1, 2018Updated 7 years ago
- A tool for calculating WER (Word Error Rate) in python.☆14Sep 18, 2024Updated last year
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- Using k-means clustering for unsupervised CNN deep learning.☆11Oct 26, 2017Updated 8 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- 4 Mic Circular Array Support for Jetson - "EXPERIMENTAL"☆11Jul 20, 2021Updated 4 years ago
- Music Ontology tools and specifications☆13Dec 18, 2011Updated 14 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Official Implementation of Interpretable Convolutional Neural Networks via Feedforward Design Arxiv: https://arxiv.org/abs/1810.02786☆12Aug 16, 2019Updated 6 years ago
- ☆13Jul 16, 2021Updated 4 years ago
- Implementation of Sorghum 3D reconstruction and skeletonization.☆13May 26, 2022Updated 3 years ago
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago