kyegomez/Audio-xLSTMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyegomez/Audio-xLSTMs)

kyegomez / Audio-xLSTMs

Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch

☆20

Alternatives and similar repositories for Audio-xLSTMs

Users that are interested in Audio-xLSTMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
vincenzomadaghiele / MINGUS
View on GitHub
A transformer neural network that generates symbolic music improvising over chord changes.
☆19Jul 14, 2024Updated 2 years ago
deezer / musicFPaugment
View on GitHub
Code for reproducting the paper Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
☆17Oct 31, 2023Updated 2 years ago
The-Swarm-Corporation / OmniParse
View on GitHub
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆20Oct 13, 2025Updated 9 months ago
chymaera96 / NeuralSampleID
View on GitHub
An automatic sample identification (ASID) system using a contrastively trained GNN encoder.
☆17Sep 21, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
Pliploop / SemiSupCon
View on GitHub
Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.
☆17Jul 24, 2024Updated 2 years ago
raraz15 / neural-music-fp
View on GitHub
"Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025
☆38Sep 11, 2025Updated 10 months ago
nomonosound / fast-align-audio
View on GitHub
A fast python library for aligning similar audio snippets passed in as NumPy arrays
☆50Oct 27, 2025Updated 8 months ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
aframires / TIVlib
View on GitHub
TIVlib is an open-source library for the content-based tonal description of musical audio signals.
☆55Sep 17, 2024Updated last year
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆20Updated this week
chymaera96 / GraFP
View on GitHub
Official repository for GraFPrint: an audio identification framework based on graph neural networks.
☆41Sep 18, 2025Updated 10 months ago
The-Swarm-Corporation / AgentGym
View on GitHub
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
☆24Oct 13, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nomonosound / log-wmse-audio-quality
View on GitHub
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…
☆39Jun 24, 2025Updated last year
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
eloimoliner / BABE2-music-restoration
View on GitHub
☆61Apr 22, 2024Updated 2 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
The-Swarm-Corporation / NewsAgent
View on GitHub
NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.
☆29Oct 13, 2025Updated 9 months ago
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
moiseshorta / MelGAN-VC
View on GitHub
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
☆12Nov 25, 2021Updated 4 years ago
seungheondoh / musical-word-embedding
View on GitHub
Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]
☆29Apr 23, 2024Updated 2 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
The-Swarm-Corporation / HTX-Swarm
View on GitHub
A sophisticated multi-agent system designed for real-time market analysis of HTX (formerly Huobi) exchange data. This swarm combines spec…
☆11Mar 18, 2025Updated last year
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
aaronabebe / micro-musicgen
View on GitHub
a new family of super small music generation models focusing on experimental music and latent space exploration capabilities
☆36May 9, 2024Updated 2 years ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
infojunkie / scalextric
View on GitHub
Like Huygens-Fokker Scala, but electric.
☆16Sep 30, 2025Updated 9 months ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Pexeso / audio-fingerprinting-benchmark-toolkit
View on GitHub
☆21Dec 19, 2023Updated 2 years ago
The-Swarm-Corporation / Brainwave
View on GitHub
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Oct 6, 2025Updated 9 months ago
xavierfav / coala
View on GitHub
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
☆48Jul 25, 2024Updated 2 years ago
kyegomez / SoundStream
View on GitHub
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆13Jan 27, 2025Updated last year
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Updated this week
yuhangsu82 / AMG-Embedding
View on GitHub
A self-supervised method for feature extraction from audio.
☆21Apr 9, 2026Updated 3 months ago
kyegomez / Paper-Implementation-Template
View on GitHub
A simple reproducible template to implement AI research papers
☆24Sep 9, 2024Updated last year