diegotg2000 / PitchFlowerView external linksLinks
Official implementation of the paper PitchFlower: A flow-based neural audio codec with pitch controllability
☆29Nov 3, 2025Updated 3 months ago
Alternatives and similar repositories for PitchFlower
Users that are interested in PitchFlower are comparing it to the libraries listed below
Sorting:
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆39Jun 17, 2025Updated 7 months ago
- Multi-speaker separation, identification, diarization ALL-IN-ONE. It can isolate the target speaker from a conversation audio and do ASR.☆61Oct 13, 2025Updated 4 months ago
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 9 months ago
- Where is the "main theme" in an orchestral score?☆12Oct 25, 2025Updated 3 months ago
- Report various statistics stemming from a confusion matrix in a tidy fashion. 🎯☆12Jul 10, 2020Updated 5 years ago
- ☆11Sep 23, 2022Updated 3 years ago
- UDP/TCP Networking for Max/MSP 8+ (nodejs)☆14Nov 27, 2021Updated 4 years ago
- Interactive Performance, Analysis and Visualization of RAVE Latent Spaces via PCA and OSC Integration☆13Jul 15, 2025Updated 6 months ago
- MIR conference deadline countdowns☆10Feb 4, 2026Updated last week
- ☆11Mar 28, 2024Updated last year
- ☆16Sep 29, 2025Updated 4 months ago
- Spatial active noise control based on kernel interpolation of sound field☆13Mar 30, 2023Updated 2 years ago
- Scala tuning files (*.scl) reader for Max/MSP.☆11Oct 25, 2018Updated 7 years ago
- ☆13Nov 2, 2020Updated 5 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Active Noise Control Headphone Implementation using Deep Learning-based GFANC☆12Aug 28, 2025Updated 5 months ago
- Morphē's TouchDesigner Palette for techniques & tools :)☆11Aug 17, 2022Updated 3 years ago
- ☆10Mar 20, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- [ICASSP 2025] Official implementation of "ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning".☆14Feb 2, 2025Updated last year
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- Run Qwen3-TTS text-to-speech locally on Mac (M1/M2/M3/M4). Voice cloning, voice design, custom voices. 100% offline using MLX.☆31Jan 28, 2026Updated 2 weeks ago
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Jan 24, 2025Updated last year
- A real-time voice conversion model based on VITS.☆14Aug 1, 2024Updated last year
- ☆11Aug 1, 2025Updated 6 months ago
- ☆13Apr 3, 2022Updated 3 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 8 months ago
- A minimal RP2350 board that can be used as a security key or a rubber ducky☆20Updated this week
- The official repo of the paper "Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Erro…☆10Oct 29, 2023Updated 2 years ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆101Nov 25, 2025Updated 2 months ago
- https://wavelandspeech.github.io/☆10Jan 12, 2024Updated 2 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated last week
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆20Aug 18, 2025Updated 5 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 10 months ago
- Translation layer that converts textured/custom server-side Polymer blocks and items to native Bedrock representations.☆23Feb 1, 2026Updated last week
- ☆13Apr 21, 2022Updated 3 years ago
- Perceived Music Quality Dataset☆12Jul 1, 2024Updated last year
- Firmata Client for Max/MSP☆11Jan 19, 2022Updated 4 years ago
- [F]inding [T]hings [I]n [S]tuff☆14Jul 28, 2022Updated 3 years ago