muthissar/diffstm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/muthissar/diffstm)

muthissar / diffstm

☆10

Alternatives and similar repositories for diffstm

Users that are interested in diffstm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyCSLParis / audioic
View on GitHub
Estimating musical surprisal/information content in Audio
☆34Apr 9, 2026Updated 3 months ago
yjlolo / dSEQ-VAE
View on GitHub
BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data
☆12May 25, 2022Updated 4 years ago
SonyCSLParis / audio-metrics
View on GitHub
Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.
☆47Jan 15, 2026Updated 6 months ago
QosmoInc / NeuralBeatbox_ML_Examples
View on GitHub
☆14Sep 13, 2022Updated 3 years ago
SonyCSLParis / pesto-full
View on GitHub
Full models and training code for PESTO
☆83Jun 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
diegocarrera89 / quantTree
View on GitHub
☆11Jul 25, 2023Updated 2 years ago
OFAI / lrn2
View on GitHub
Python deep learning framework including [Convolutional] Restricted Boltzmann Machines (RBMs), [Convolutional] Neural Networks and Auto-E…
☆14Jan 10, 2017Updated 9 years ago
rsprouse / xray_microbeam_database
View on GitHub
Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)
☆14Oct 8, 2020Updated 5 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
duongtrung / time-series-temporal-saliency-patterns
View on GitHub
☆15Jul 25, 2023Updated 2 years ago
limengbinggz / CUSUM-RL
View on GitHub
Implementation of "Reinforcement Learning in Possibly Nonstationary Environments"
☆10Mar 10, 2025Updated last year
ligmg / ligmg
View on GitHub
Parallel solver for graph Laplacians from large social networks.
☆11Jan 5, 2022Updated 4 years ago
winlinvip / srs-k2
View on GitHub
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
☆20Apr 16, 2023Updated 3 years ago
SonyCSLParis / ssl-singer-identity
View on GitHub
☆69Nov 6, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
danijel3 / ClarinStudioKaldi
View on GitHub
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Dec 21, 2021Updated 4 years ago
llbxg / hundun
View on GitHub
🦋 hundun is a python library for the exploration of chaos.
☆13May 19, 2023Updated 3 years ago
morgan76 / HE
View on GitHub
PyTorch implementation of the paper Learning Multi-Level Representations for Hierarchical Music Structure Analysis presented at ISMIR 202…
☆16Jan 2, 2023Updated 3 years ago
fundwotsai2001 / Text-to-music-dataset-preparation
View on GitHub
A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]
☆28May 20, 2025Updated last year
qcri / dialectal_arabic_tools
View on GitHub
☆14Nov 13, 2022Updated 3 years ago
collectivat / cmusphinx-models
View on GitHub
Acoustic and language models for minorised languages.
☆26Updated this week
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 7 months ago
xinyiguan / py2lispIDyOM
View on GitHub
A Python package for IDyOM
☆14Mar 31, 2023Updated 3 years ago
UW-Madison-Lee-Lab / score-wasserstein
View on GitHub
Code for "Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance", NeurIPS 2022
☆17Feb 11, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
MSD-IRIMAS / CF-4-TSC
View on GitHub
Hand-Crafted Convolutional Filters 4 Time Series
☆17May 27, 2025Updated last year
SonyCSLParis / cgae-invar
View on GitHub
Convolutional gated autoencoder for learning transposition-invariant features from audio
☆24Mar 20, 2019Updated 7 years ago
guozixunnicolas / DENT_DDSP
View on GitHub
☆24Jun 30, 2023Updated 3 years ago
GT-RIPL / Geometric-Sensitivity-Decomposition
View on GitHub
☆18Oct 29, 2021Updated 4 years ago
vanessa-silva / NetF
View on GitHub
NetF, an alternative set of features, incorporating several representative topological measures of different complex networks mappings of…
☆18Jun 14, 2022Updated 4 years ago
ShoukanLabs / VoPho
View on GitHub
A collection of all our phonemeizers for dataset construction and inference
☆30Feb 21, 2025Updated last year
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
f0k / minimp3py
View on GitHub
Python bindings for minimp3
☆17Sep 11, 2023Updated 2 years ago
apmcleod / met-align
View on GitHub
A model for meter detection and alignment from live performance MIDI data.
☆17Aug 4, 2020Updated 5 years ago
deezer / musicFPaugment
View on GitHub
Code for reproducting the paper Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
☆17Oct 31, 2023Updated 2 years ago
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
bregaldo / pywst
View on GitHub
(Reduced) Wavelet Scattering Transform on Images
☆13Jan 28, 2025Updated last year
tandav / musiclib
View on GitHub
Set of tools to work with scales, modes, modulations, chord progressions, voice leading, rhythm and more
☆18Jan 19, 2025Updated last year
STherese / NA-MEMD-for-EEG
View on GitHub
Material related to "Unmixing oscillatory brain activity by EEG source localization and empirical mode decomposition". Implementation of …
☆16Jan 25, 2019Updated 7 years ago