☆10Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for diffstm
Users that are interested in diffstm are comparing it to the libraries listed below
Sorting:
- ☆14Sep 13, 2022Updated 3 years ago
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- ☆14Nov 13, 2022Updated 3 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- Estimating musical surprisal/information content in Audio☆23Jan 19, 2026Updated last month
- ☆23Jun 30, 2023Updated 2 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 9 months ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆104Nov 25, 2025Updated 3 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- Acoustic and language models for minorised languages.☆26Sep 30, 2020Updated 5 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated 3 weeks ago
- ☆22Apr 8, 2022Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Digital Signals Theory book and source materials☆36Jan 7, 2026Updated last month
- fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics☆26May 24, 2023Updated 2 years ago
- Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner☆29May 11, 2019Updated 6 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- A Python implementation of the InverSynth method (Barkan, Tsiris, Koenigstein, Katz)☆32Dec 26, 2022Updated 3 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Nov 16, 2018Updated 7 years ago
- Full models and training code for PESTO☆75Jun 12, 2024Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆34Feb 8, 2020Updated 6 years ago
- ☆87May 21, 2023Updated 2 years ago
- Book event tickets securely using blockchain technology! Our decentralized application leverages Ethereum for transparent ticket transact…☆12Aug 3, 2023Updated 2 years ago
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated last month
- Python binding for SRI Language Modeling Toolkit implemented in Cython☆30Jan 24, 2022Updated 4 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated last year
- Official Implementation of Jointist☆37Jul 26, 2023Updated 2 years ago
- Rhythm GAN/CAN player for Ableton Live / Max for Live☆37Sep 2, 2021Updated 4 years ago
- Steer OpenAI's Jukebox with Music Taggers☆42Apr 21, 2022Updated 3 years ago
- The training code for the 4th place model at MDX 2021 leaderboard A.☆36Sep 1, 2021Updated 4 years ago
- Source code for "FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control"☆165Oct 15, 2024Updated last year