CorentinJ / TorchStreamLinks
A library for making PyTorch models streamable
☆37Updated this week
Alternatives and similar repositories for TorchStream
Users that are interested in TorchStream are comparing it to the libraries listed below
Sorting:
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆127Updated 2 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆138Updated last month
- An unofficial PyTorch implementation of VALL-E☆88Updated 4 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Official repository of Wavehax vocoder☆62Updated this week
- ☆61Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆54Updated 2 years ago
- Audiogen Codec☆143Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆75Updated last week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 6 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆47Updated 3 months ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆43Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- ☆70Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆90Updated 10 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆193Updated 4 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆129Updated 4 months ago
- GPT for FACodec☆13Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆44Updated 2 months ago
- small audio language model for reasoning☆81Updated last week
- Official implementation for FlowSep☆68Updated 11 months ago
- ☆43Updated last year
- ☆102Updated 2 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆104Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆113Updated 11 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 8 months ago