idiap / hypermixing
PyTorch implementation for HyperMixing, a linear-time token-mixing technique used in HyperMixer architecture
☆23Updated last year
Alternatives and similar repositories for hypermixing:
Users that are interested in hypermixing are comparing it to the libraries listed below
- ConMamba for Automatic Speech Recognition☆66Updated 8 months ago
- ☆43Updated last year
- The project for speech translation☆11Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 7 months ago
- ☆24Updated 8 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆28Updated 9 months ago
- A neural speech codec based on discrete WavLM representations☆23Updated 7 months ago
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆84Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆52Updated 5 months ago
- Source code for DM-Codec.☆41Updated 6 months ago
- ☆75Updated 6 months ago
- The demo page for ALMTokenizer☆42Updated last week
- A lightweight audio codec based on a single quantizer☆50Updated last week
- End-to-End Speech Processing Toolkit☆13Updated 3 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆16Updated 9 months ago
- ARCH: Audio Representations benCHmark☆44Updated 7 months ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated last month
- ☆10Updated 4 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆49Updated last year
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆37Updated last year
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆54Updated 3 months ago
- ☆18Updated 11 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆72Updated 2 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆60Updated last year
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆72Updated last year
- A spoken version of the textual story cloze benchmark☆16Updated last year
- ☆15Updated last year