TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Jun 1, 2023Updated 2 years ago
Alternatives and similar repositories for TriNet
Users that are interested in TriNet are comparing it to the libraries listed below
Sorting:
- ☆15Apr 2, 2025Updated 11 months ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆21Nov 25, 2024Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- ☆33Nov 27, 2021Updated 4 years ago
- Implementation of Google's USM speech model in Pytorch☆35Feb 7, 2026Updated last month
- ☆37Jun 30, 2022Updated 3 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated last year
- ☆26Apr 21, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Jun 5, 2025Updated 9 months ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆31Mar 6, 2025Updated last year
- ☆61Nov 4, 2023Updated 2 years ago
- [ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.☆118Oct 17, 2025Updated 5 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- ☆36Sep 6, 2025Updated 6 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 8 months ago
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆61Sep 24, 2025Updated 5 months ago
- Collect Voice Conversion researches☆96Updated this week
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 weeks ago
- ☆37Feb 23, 2022Updated 4 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 2 months ago
- Pytorch Implementation of WaveNODE☆64Sep 4, 2020Updated 5 years ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆11Mar 22, 2023Updated 3 years ago
- ☆67Aug 16, 2023Updated 2 years ago