CorentinJ / TorchStreamLinks
A library for making PyTorch models streamable
☆57Updated 2 weeks ago
Alternatives and similar repositories for TorchStream
Users that are interested in TorchStream are comparing it to the libraries listed below
Sorting:
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆149Updated last week
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆149Updated 3 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- OpenFLAM: Framewise Language Audio Model☆86Updated 3 weeks ago
- GPT for FACodec☆13Updated last year
- Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"☆82Updated 2 months ago
- ☆86Updated last year
- Official implementation for FlowSep☆69Updated last year
- Official repository of Wavehax vocoder☆66Updated last month
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆107Updated last year
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆47Updated last week
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Updated 10 months ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 6 months ago
- ☆124Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆113Updated last week
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆48Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆106Updated 3 weeks ago
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆197Updated 6 months ago
- Audiogen Codec☆144Updated last year
- ☆44Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Updated 2 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆48Updated 5 months ago
- Codebase and project page for EDMSound☆35Updated 2 years ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆101Updated 3 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week