A Pytorch Implementations for Various Vector Quantization Methods
☆36Sep 14, 2021Updated 4 years ago
Alternatives and similar repositories for pytorch-vector-quantization
Users that are interested in pytorch-vector-quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- A Web Application for Baroque-style Human/Computer Musical Jamming.☆15May 31, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implemtation of UniverSR (ICASSP 2026)☆55Apr 9, 2026Updated 2 months ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 3 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Dec 3, 2024Updated last year
- ☆10Apr 8, 2024Updated 2 years ago
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- Subband Adaptive System with Crossterms for aliasing reduction☆18Jul 31, 2022Updated 3 years ago
- ☆16Feb 10, 2026Updated 4 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆116Nov 2, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆37Feb 10, 2024Updated 2 years ago
- A benchmark corpus for ASR hypothesis revising task☆21Sep 26, 2023Updated 2 years ago
- A framework for Bayesian optimization of composite functions.☆15Dec 8, 2022Updated 3 years ago
- ☆31Jul 13, 2023Updated 2 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 3 years ago
- Deep clustering for seismic signals (icequakes and earthquakes)☆15Dec 25, 2021Updated 4 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆37May 8, 2021Updated 5 years ago
- Information on GrIMP tools with links to other repositories☆21Aug 20, 2025Updated 10 months ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆80Mar 17, 2025Updated last year
- Processing functions for Fiber Optic Distributed Sensing (FODS) data.☆23May 9, 2023Updated 3 years ago
- ☆10Mar 29, 2021Updated 5 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆68Mar 5, 2026Updated 3 months ago
- This is the code for the EMNLP2020 Finding paper "BERT for Monolingual and Cross-Lingual Reverse Dictionary"☆19Sep 27, 2020Updated 5 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model☆12Jul 24, 2023Updated 2 years ago
- This repo includes beat and bar annotations for the ballroom dataset.☆25Sep 6, 2023Updated 2 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Oct 19, 2022Updated 3 years ago
- A research of Manchu hypothesis of Voynich manuscript. It's an Oracle database with tabes, DML scripts, PLSQL functions and queries.☆16Jun 11, 2014Updated 12 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 4 years ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆23Jun 3, 2023Updated 3 years ago
- TPSE-GST Tacotron2☆14May 1, 2019Updated 7 years ago