CorentinJ / TorchStreamLinks
A library for making PyTorch models streamable
☆57Updated 2 weeks ago
Alternatives and similar repositories for TorchStream
Users that are interested in TorchStream are comparing it to the libraries listed below
Sorting:
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆149Updated last week
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆149Updated 3 months ago
- Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"☆82Updated 2 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆106Updated 3 weeks ago
- Audiogen Codec☆144Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated this week
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆50Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- Official repository of Wavehax vocoder☆66Updated last month
- Audio tokenization, in the fastest way possible!☆53Updated last year
- ☆86Updated last year
- GPT for FACodec☆13Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Updated 10 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆107Updated last year
- An unofficial PyTorch implementation of VALL-E☆88Updated 6 months ago
- Official implementation for FlowSep☆69Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆197Updated last week
- OpenFLAM: Framewise Language Audio Model☆86Updated 3 weeks ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- small audio language model for reasoning☆86Updated 2 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆197Updated 6 months ago
- ☆61Updated 2 years ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆101Updated 3 months ago
- ☆44Updated last year
- ☆32Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆135Updated 5 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Updated last year
- ☆106Updated 4 months ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆43Updated 3 years ago