Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Jun 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for wav2tok
Users that are interested in wav2tok are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Jan 20, 2025Updated last year
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆38Sep 11, 2025Updated 9 months ago
- ☆19Feb 2, 2023Updated 3 years ago
- The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).☆27Nov 17, 2025Updated 7 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆29Sep 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆265Feb 14, 2024Updated 2 years ago
- Neural Network Audio FingerPrint☆65Mar 5, 2023Updated 3 years ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆44Jun 13, 2024Updated 2 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- ☆64Apr 28, 2026Updated 2 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆40Feb 24, 2025Updated last year
- ☆12Mar 11, 2025Updated last year
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Aug 30, 2021Updated 4 years ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆198May 29, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆166Dec 8, 2025Updated 6 months ago
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆84Dec 3, 2024Updated last year
- Geometry features for block window cover song identification (a continuation of my ISMIR 2015 paper)☆24Jul 6, 2023Updated 2 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- music semantic understanding evaluation benchmark☆24Aug 12, 2023Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- ☆51Mar 5, 2026Updated 3 months ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆68Mar 5, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated last year
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- ☆18Jun 24, 2025Updated last year
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆21Dec 3, 2021Updated 4 years ago
- List of academic resources on Multimodal ML for Music☆301Mar 25, 2023Updated 3 years ago
- ☆13Jul 14, 2024Updated last year
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆143Mar 8, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An open-source Kazakh Emotional Text-to-Speech Dataset☆36Aug 1, 2025Updated 10 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆66Nov 5, 2025Updated 7 months ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆55Jan 18, 2024Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year