Spectra extraction tutorials based on torch and torchaudio.
☆41Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for spectra
Users that are interested in spectra are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 14, 2022Updated 3 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- Voice Activity Detection (VAD) using deep learning.☆204Oct 14, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- A pipeline to isolate and transcribe one language in mixed-language speech☆20Oct 25, 2022Updated 3 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Voice Activity Detection in speech signals using short time energy and zero-crossings rate☆20Jun 3, 2022Updated 3 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆79Aug 19, 2022Updated 3 years ago
- Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Data☆17Sep 30, 2023Updated 2 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- This is my PyTorch implementation of the "Very Deep Convolutional Neural Networks For Raw Waveforms" research paper published in 2016.☆17Aug 24, 2021Updated 4 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆160Jan 9, 2023Updated 3 years ago
- Core digital signal processing function library☆23May 20, 2025Updated 10 months ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆78Nov 9, 2019Updated 6 years ago
- Voice Activity Detector in Python☆480Nov 17, 2020Updated 5 years ago
- 方言分类,pytorch☆43Sep 25, 2018Updated 7 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated last year
- ☆17Dec 17, 2025Updated 3 months ago
- ☆18Mar 25, 2023Updated 3 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Adaptive Line Enhancer (ALE) based on Least Mean Square (LMS) algorithm to eliminate broadband noise from a narrowband signal☆24Mar 31, 2019Updated 7 years ago
- Text Classification model deployment using FastAPI, Streamlit and Docker Compose☆14Feb 12, 2021Updated 5 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Social previews generator as a microservice.☆11Apr 9, 2022Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 4 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- This repo is for Audio Processing Techniques and the Silence Remove using Python☆17Jul 13, 2020Updated 5 years ago
- Basing on Adaptive Line Enhancer/Canceler technique to reduce tonal noise by using LMS, RLS, NLMS and Kalman adaptive filter.☆20May 17, 2018Updated 7 years ago
- ☆13Apr 14, 2024Updated 2 years ago
- A simple example of running a MongoDB instance to query a database☆10Aug 31, 2022Updated 3 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆11Jan 29, 2022Updated 4 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago