DeepSpectrum/DeepSpectrumLite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DeepSpectrum/DeepSpectrumLite)

DeepSpectrum / DeepSpectrumLite

Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks.

☆18

Alternatives and similar repositories for DeepSpectrumLite

Users that are interested in DeepSpectrumLite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DeepSpectrum / DeepSpectrum
View on GitHub
☆138Aug 29, 2024Updated last year
K-GOKULAPPADURAI / RespireNet-Respiratory-Disease-Prediction-Web-Application-Using-Deep-Learning
View on GitHub
RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficient…
☆10Aug 2, 2023Updated 2 years ago
thelahunginjeet / pyica
View on GitHub
python code for Independent Component Analysis
☆14Jan 8, 2018Updated 8 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
johnmartinsson / differentiable-mel-spectrogram
View on GitHub
The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …
☆24Dec 21, 2024Updated last year
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
stijani / elastic-weight-consolidation-tf2
View on GitHub
☆17Aug 10, 2021Updated 4 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
timjaya / lastfm
View on GitHub
A music recommender system using Last.fm data
☆12Oct 17, 2019Updated 6 years ago
belal981 / depression-detection
View on GitHub
Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…
☆14Jul 10, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jackgle / YAMNet-transfer-learning
View on GitHub
Transfer learning and fine-tuning with YAMNet
☆21Jan 20, 2026Updated 6 months ago
cadia-lvl / samromur-asr
View on GitHub
Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
☆12Sep 30, 2022Updated 3 years ago
hhhaaahhhaa / ASR-TTA
View on GitHub
☆16Nov 4, 2025Updated 8 months ago
yukimasano / single-img-extrapolating
View on GitHub
Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"
☆36Jul 16, 2024Updated 2 years ago
linjac / GenDARA
View on GitHub
☆13Jan 14, 2025Updated last year
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
KosminD / YAMNet_transfer
View on GitHub
☆21Sep 2, 2020Updated 5 years ago
sahilsharma884 / Music-Genre-Classification
View on GitHub
Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…
☆15Jun 28, 2020Updated 6 years ago
EIHW / MuSe2022
View on GitHub
☆28May 13, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
YuanGongND / llm_speech_emotion_challenge
View on GitHub
☆23Jun 24, 2024Updated 2 years ago
AmirHoseein99 / Depression-Engine
View on GitHub
Detecting depressed Patient based on Speech Activity, Pauses in Speech and Using Deep learning Approach
☆20Jan 5, 2023Updated 3 years ago
Keerthiraj-Nagaraj / cough-detection-with-transfer-learning
View on GitHub
Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques
☆17Dec 12, 2020Updated 5 years ago
jymh / SAP2-ASR
View on GitHub
☆26Jan 23, 2026Updated 5 months ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
nafiuny / ICRCycleGAN-VC
View on GitHub
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Apr 15, 2026Updated 3 months ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
FederatedML / FedSTAR
View on GitHub
Federated Self-Training for Data-Efficient Audio Recognition
☆10May 7, 2022Updated 4 years ago
google-research / pactran_metrics
View on GitHub
☆14Mar 24, 2023Updated 3 years ago
ECNU-Cross-Innovation-Lab / ShiftSER
View on GitHub
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆39Dec 18, 2023Updated 2 years ago
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆35Jul 13, 2026Updated last week
yikun-baio / sliced_opt
View on GitHub
☆14Jun 3, 2024Updated 2 years ago