alireza-nasiri/SoundCLR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alireza-nasiri/SoundCLR)

alireza-nasiri / SoundCLR

Implementation for "SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification," in pytorch.

☆29

Alternatives and similar repositories for SoundCLR

Users that are interested in SoundCLR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mohaimenz / acdnet
View on GitHub
Official repository: Environmental Sound Classiﬁcation on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrain…
☆43Jul 19, 2023Updated 2 years ago
liuxubo717 / SimPFs
View on GitHub
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
☆57Mar 3, 2023Updated 3 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 3 years ago
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆41Sep 9, 2025Updated 10 months ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
amazon-science / unsupervised-melody-to-lyrics-generation
View on GitHub
This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…
☆11Jul 6, 2023Updated 3 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆16Dec 3, 2024Updated last year
ristea / septr
View on GitHub
☆29Sep 29, 2022Updated 3 years ago
arnabdas8901 / StarGAN-VC_PlusPlus
View on GitHub
☆11Aug 11, 2023Updated 2 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 2 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
andrebola / contrastive-mir-learning
View on GitHub
This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"
☆15Jun 22, 2023Updated 3 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
Alibaba-MIIL / AudioClassfication
View on GitHub
☆90May 27, 2023Updated 3 years ago
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
jfainberg / sincnet_adapt
View on GitHub
Raw waveform adaptation with SincNet
☆12Mar 19, 2024Updated 2 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
RBenita / DIFFAR
View on GitHub
Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
☆32Mar 8, 2024Updated 2 years ago
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
TAMS-Group / tams_glass_reconstruction
View on GitHub
Detection and Reconstruction of Transparent Objects with Infrared Projection-based RGB-D Cameras
☆13Jan 17, 2021Updated 5 years ago
WangHelin1997 / MaskSpec
View on GitHub
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆51Dec 17, 2024Updated last year
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
daanzu / py-silero-vad-lite
View on GitHub
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆17Nov 25, 2024Updated last year
Ydkwim / CTAL
View on GitHub
Pre-training Cross-modal Transformer for Audio-and-Language Representations
☆39Apr 20, 2021Updated 5 years ago
deegy666 / ADD-RSC
View on GitHub
Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’
☆22Dec 19, 2025Updated 6 months ago
zhai-lw / L3AC
View on GitHub
A lightweight audio codec based on a single quantizer
☆34Sep 4, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
aask1357 / hilcodec
View on GitHub
High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec
☆119Jun 23, 2025Updated last year
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
wngh1187 / RawNeXt
View on GitHub
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…
☆25Jun 22, 2022Updated 4 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
fubel / stmc
View on GitHub
Code for the paper "Spatial-Temporal Multi-Cuts for Online Multiple-Camera Vehicle Tracking"
☆15Apr 12, 2025Updated last year