Implementation of the paper "Can Large Language Models Predict Audio Effects Parameters from Natural Language?"
☆28May 27, 2025Updated 9 months ago
Alternatives and similar repositories for LLM2Fx
Users that are interested in LLM2Fx are comparing it to the libraries listed below
Sorting:
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- Multitrack music mixing style transfer given a reference song using differentiable mixing console.☆58Jul 7, 2025Updated 8 months ago
- ☆12Nov 7, 2024Updated last year
- ☆28Jul 7, 2025Updated 8 months ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆42Updated this week
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"☆49Aug 23, 2025Updated 7 months ago
- ☆37Nov 18, 2025Updated 4 months ago
- ☆32Nov 24, 2024Updated last year
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 7 months ago
- ☆13Sep 12, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- Source code repository for the SMC paper "Musical Tempo and Key Estimation using Convolutional Neural Networks with Directional Filters".☆34Mar 24, 2023Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- ☆11Mar 22, 2023Updated 3 years ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆60Jul 2, 2025Updated 8 months ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- ☆12Oct 9, 2023Updated 2 years ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 5 months ago
- ☆14Sep 13, 2022Updated 3 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- ☆26Mar 23, 2024Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 5 months ago
- ☆21Jul 15, 2024Updated last year
- OpenFLAM: Framewise Language Audio Model☆101Jan 14, 2026Updated 2 months ago
- Swarah: Indian-English speech dataset collected across the country☆37Jul 3, 2025Updated 8 months ago
- ☆30Feb 4, 2021Updated 5 years ago
- Guqin performance analysis☆12Aug 31, 2020Updated 5 years ago
- Python code used to analyze and process symbolic drum patterns☆14May 8, 2023Updated 2 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆18May 12, 2025Updated 10 months ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago