FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆19Nov 15, 2023Updated 2 years ago
Alternatives and similar repositories for FitHuBERT
Users that are interested in FitHuBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆117Jan 26, 2024Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆16Apr 29, 2025Updated 11 months ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Qualifying Exam Preparing☆17May 7, 2025Updated 11 months ago
- ☆11Nov 12, 2020Updated 5 years ago
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆20Apr 18, 2023Updated 2 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆20Apr 24, 2025Updated 11 months ago
- ☆12Nov 30, 2021Updated 4 years ago
- A simple compiler for SysY (tiny C). Based on Bison, LLVM.☆10Dec 24, 2024Updated last year
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- ☆14Apr 6, 2025Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Defending against Adversarial Audio via Diffusion Model (ICLR 2023)☆35Mar 2, 2023Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 10 months ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆57May 19, 2023Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" [IEEE MLSP 2025] …☆39Jul 31, 2024Updated last year
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆185May 30, 2023Updated 2 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- Implementation of a differentially private generative adversarial network.☆11Nov 20, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆42Aug 14, 2025Updated 8 months ago
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆41Nov 1, 2021Updated 4 years ago
- ☆21Feb 11, 2022Updated 4 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- UniParse: A universal graph-based parsing toolkit☆10Oct 2, 2019Updated 6 years ago
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- Code accompanying AES Semantic Audio Conference paper titled "A Dataset and Method for Guitar Solo Detection in Rock Music"☆12Jan 18, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- [CCS-LAMPS 2024] Mitigating Unauthorized Speech Synthesis for Voice Protection☆19Nov 1, 2024Updated last year
- ☆15Oct 19, 2020Updated 5 years ago
- FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…☆13Mar 19, 2026Updated 3 weeks ago
- This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …☆12Nov 15, 2018Updated 7 years ago
- ☆12Dec 29, 2023Updated 2 years ago
- Keras implementation of SwishNet (arXiv:1812.00149)☆15Feb 25, 2019Updated 7 years ago