Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/
☆15Oct 15, 2018Updated 7 years ago
Alternatives and similar repositories for gammatonegram
Users that are interested in gammatonegram are comparing it to the libraries listed below
Sorting:
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Nov 30, 2020Updated 5 years ago
- ☆14Apr 18, 2019Updated 6 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)☆12Apr 24, 2020Updated 5 years ago
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 7 years ago
- ☆21Jul 15, 2024Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- GUI tools for WORLD vocoder☆22Dec 19, 2024Updated last year
- Template that combines PyTorch Lightning and Hydra☆15Aug 15, 2023Updated 2 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆129Aug 12, 2020Updated 5 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- ☆20May 13, 2019Updated 6 years ago
- ☆28Sep 5, 2024Updated last year
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆227Jun 29, 2023Updated 2 years ago
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆22Feb 20, 2019Updated 7 years ago
- Python implementation of Gammatone filter☆25Jun 7, 2022Updated 3 years ago
- ☆32Jan 9, 2024Updated 2 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Glottal Flow Model-based Iterative Adaptive Inverse Filtering☆27Sep 28, 2020Updated 5 years ago
- ☆25Feb 13, 2026Updated 3 weeks ago
- Vocal Tract Area Estimation by Gradient Descent☆38Jul 16, 2023Updated 2 years ago
- ☆36Jan 6, 2026Updated 2 months ago
- Our DCASE 2019 challenge task 3 method☆32Jan 17, 2023Updated 3 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆45Sep 6, 2024Updated last year
- ☆14Sep 18, 2025Updated 5 months ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- Differentiable dynamic range controller in PyTorch.☆52Feb 10, 2026Updated 3 weeks ago
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆12Sep 30, 2019Updated 6 years ago
- Simple LPC vocoder in Python☆13Jan 7, 2022Updated 4 years ago
- Fetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.☆11Aug 18, 2016Updated 9 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a building block for Indic text-to-speech (TTS) systems☆12Nov 15, 2017Updated 8 years ago
- ☆10Dec 16, 2022Updated 3 years ago