Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks.
☆18Apr 16, 2022Updated 4 years ago
Alternatives and similar repositories for DeepSpectrumLite
Users that are interested in DeepSpectrumLite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆138Aug 29, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆14Oct 14, 2023Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Dec 21, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficient…☆10Aug 2, 2023Updated 2 years ago
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆59Jul 1, 2024Updated last year
- ☆17Aug 10, 2021Updated 4 years ago
- recent audio generation papers (including speech, music and general audios)☆13Mar 14, 2023Updated 3 years ago
- ☆15Jul 4, 2024Updated last year
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Dec 19, 2021Updated 4 years ago
- ☆18Jul 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- AI-SAM: Automatic and Interactive Segment Anything Model☆21Feb 25, 2025Updated last year
- MATLAB + Python implementations of real-time median-filtering Harmonic-Percussive Source Separation☆22Sep 9, 2021Updated 4 years ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Jul 16, 2024Updated last year
- Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…☆15Jun 28, 2020Updated 5 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 5 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- ☆13Jan 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- ☆19Mar 2, 2024Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- ☆23Jun 24, 2024Updated last year
- Song Plays Workshop Tutorial☆13Nov 19, 2020Updated 5 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Apr 15, 2026Updated last month
- Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques☆17Dec 12, 2020Updated 5 years ago
- ☆65Jun 28, 2023Updated 2 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Implementation of Google's USM speech model in Pytorch☆36May 11, 2026Updated last week
- ☆14Mar 24, 2023Updated 3 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 8 months ago
- 80s FM video game music dataset (ISMIR 2022)☆30Jan 10, 2023Updated 3 years ago
- Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach☆20Jan 5, 2023Updated 3 years ago
- ☆14Jun 3, 2024Updated last year