kyegomez/SoundStream

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/SoundStream)

kyegomez / SoundStream

Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"

☆13

Alternatives and similar repositories for SoundStream

Users that are interested in SoundStream are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyegomez / forest-of-thoughts
View on GitHub
A forest of autonomous agents.
☆20Jan 27, 2025Updated last year
The-Swarm-Corporation / agentverse
View on GitHub
Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!
☆17Dec 22, 2025Updated 7 months ago
haydenshively / SoundStream
View on GitHub
Implementation of SoundStream, an end-to-end neural audio codec
☆33Jun 11, 2023Updated 3 years ago
kyegomez / OpenStrawberry
View on GitHub
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆30Updated this week
csiro-robotics / iSICE
View on GitHub
[CVPR2023] The official repository for paper "Learning Partial Correlation based Deep Visual Representation for Image Classification" To …
☆10Nov 21, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆20Updated this week
BJTUSensor / CS_KSVD
View on GitHub
This code aims to reconstruct the original BGS by using a compressed sensing method based on K-SVD algorithm.
☆10Oct 6, 2022Updated 3 years ago
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
CodingVillainKor / SimpleDeepLearning
View on GitHub
Simple Deep learning projects
☆18May 20, 2026Updated 2 months ago
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
konstantgr / smatched
View on GitHub
Web app for makeup transfer using Stable Diffusion
☆10Sep 11, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
The-Swarm-Corporation / Brainwave
View on GitHub
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Oct 6, 2025Updated 9 months ago
kyegomez / Pairformer
View on GitHub
Implementation of the Pairformer model used in AlphaFold 3
☆14Updated this week
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Updated this week
kyegomez / OmniByteFormer
View on GitHub
OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…
☆15Updated this week
kyegomez / TinyGPTV
View on GitHub
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
☆16Nov 11, 2024Updated last year
LAION-AI / worldsim
View on GitHub
☆13Aug 29, 2023Updated 2 years ago
jiang-du / Perceptual-CS
View on GitHub
Official code for papers "Perceptual Compressive Sensing" at PRCV 2018 and "Fully Convolutional Measurement Network for Compressive Sensi…
☆18Aug 6, 2019Updated 6 years ago
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
LeeKeyu / abdominal_ultrasound_classification
View on GitHub
Combining deep neural networks with PCA and k-NN classification for abdominal organ recognition in ultrasound images.
☆28Oct 12, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kyegomez / TeraGPT
View on GitHub
Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT
☆17Updated this week
ZackBradshaw / ikigAI
View on GitHub
☆13Mar 28, 2024Updated 2 years ago
kyegomez / CogNetX
View on GitHub
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…
☆20Updated this week
kyegomez / AutoRT
View on GitHub
Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"
☆44Nov 11, 2024Updated last year
kyegomez / Tiktokx
View on GitHub
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…
☆14Aug 18, 2023Updated 2 years ago
kyegomez / HSSS
View on GitHub
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆16Nov 11, 2024Updated last year
kyegomez / EAOT
View on GitHub
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆19Mar 11, 2024Updated 2 years ago
The-Swarm-Corporation / OmniParse
View on GitHub
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆20Oct 13, 2025Updated 9 months ago
kyegomez / MELLE
View on GitHub
An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"
☆16Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
The-Swarm-Corporation / awesome-automated-prompt-engineering
View on GitHub
This repository serves as a central hub for discovering tools and services focused on automated prompt engineering. Whether you're lookin…
☆16Oct 11, 2024Updated last year
transformsai / AgenceTrainingEnvironment
View on GitHub
☆13Mar 12, 2021Updated 5 years ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆22Jan 10, 2025Updated last year
The-Swarm-Corporation / radiology-swarm
View on GitHub
A powerful, enterprise-grade multi-agent system for advanced radiological analysis, diagnosis, and treatment planning. This system levera…
☆18Oct 13, 2025Updated 9 months ago
lonelywing / POSTECH_thesis_template_latex
View on GitHub
☆36Nov 21, 2022Updated 3 years ago
kyegomez / Prometheus
View on GitHub
Welcome to Prometheus, the revolutionary AI model that allows you to generate DNA sequences for any creature you can imagine. Whether it’…
☆15Updated this week
walkoncross / voxceleb2-download-zyf
View on GitHub
Tools for downloading VoxCeleb2 dataset
☆35Mar 16, 2024Updated 2 years ago