GoGoDuck912/pytorch-vector-quantization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GoGoDuck912/pytorch-vector-quantization)

GoGoDuck912 / pytorch-vector-quantization

A Pytorch Implementations for Various Vector Quantization Methods

☆36

Alternatives and similar repositories for pytorch-vector-quantization

Users that are interested in pytorch-vector-quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
xinan-chen / AP_BWE
View on GitHub
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
☆13Jul 22, 2024Updated 2 years ago
revsic / torch-whisper-guided-vc
View on GitHub
Torch implementation of Whisper-guided DDPM based Voice Conversion
☆49Mar 7, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
hcy71o / LPC_Speech_Synthesis
View on GitHub
Speech synthesis using LPC
☆25Jun 5, 2021Updated 5 years ago
ilaria-manco / song-describer
View on GitHub
Song Describer is a data collection platform for annotating music with textual descriptions.
☆61Dec 3, 2024Updated last year
viduzz84 / SubbandAdaptiveX
View on GitHub
Subband Adaptive System with Crossterms for aliasing reduction
☆18Jul 31, 2022Updated 3 years ago
ffaisal93 / SD-QA
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
ZhangXInFD / soundstorm-speechtokenizer
View on GitHub
Implementation of SoundStorm built upon SpeechTokenizer.
☆116Nov 2, 2023Updated 2 years ago
yongyizang / BachDuet-WebGUI
View on GitHub
A Web Application for Baroque-style Human/Computer Musical Jamming.
☆15May 31, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Alfred0622 / HypR
View on GitHub
A benchmark corpus for ASR hypothesis revising task
☆21Sep 26, 2023Updated 2 years ago
RUCAIBox / LSVCR
View on GitHub
☆14Apr 1, 2024Updated 2 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
ga642381 / SpeechGen
View on GitHub
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
☆77Jun 9, 2023Updated 3 years ago
frankxu2004 / knnlm-why
View on GitHub
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Jan 12, 2023Updated 3 years ago
NeptuneProjects / RISCluster
View on GitHub
Deep clustering for seismic signals (icequakes and earthquakes)
☆15Dec 25, 2021Updated 4 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
Berkeley-Speech-Group / sylber
View on GitHub
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
☆80Mar 17, 2025Updated last year
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BenjaminTMilnes / ManchuDictionary
View on GitHub
A Manchu dictionary website
☆12Feb 26, 2026Updated 4 months ago
fastice / GrIMPTools
View on GitHub
Information on GrIMP tools with links to other repositories
☆21Aug 20, 2025Updated 11 months ago
maxidl / wav2vec2
View on GitHub
☆10Mar 29, 2021Updated 5 years ago
lucidrains / nim-tokenizer
View on GitHub
Implementation of a simple BPE tokenizer, but in Nim
☆22Jul 2, 2023Updated 3 years ago
klapo / pyfocs
View on GitHub
Processing functions for Fiber Optic Distributed Sensing (FODS) data.
☆23May 9, 2023Updated 3 years ago
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
lyingCS / UOEP
View on GitHub
Reinforcing Long-Term Performance in Recommender Systems with User-Oriented Exploration Policy (SIGIR 2024)
☆14Oct 6, 2024Updated last year
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
yhcc / BertForRD
View on GitHub
This is the code for the EMNLP2020 Finding paper "BERT for Monolingual and Cross-Lingual Reverse Dictionary"
☆19Sep 27, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
seoneun / T5-Question-Generation
View on GitHub
SQuAD Question Generation module based on T5-large
☆18Aug 26, 2022Updated 3 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
woongzip1 / UniverSR
View on GitHub
Official implemtation of UniverSR (ICASSP 2026)
☆59Apr 9, 2026Updated 3 months ago
CPJKU / BallroomAnnotations
View on GitHub
This repo includes beat and bar annotations for the ballroom dataset.
☆25Sep 6, 2023Updated 2 years ago
voidful / llm-codec
View on GitHub
LLM-Codec: Neural Audio Codec Meets Language Model Objectives
☆23May 3, 2026Updated 2 months ago
xrenaa / Retriever
View on GitHub
[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"
☆54Oct 19, 2022Updated 3 years ago
epfl-radio-astro / bipp
View on GitHub
☆11Jul 5, 2025Updated last year