Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch
☆18Jun 3, 2024Updated last year
Alternatives and similar repositories for simplistic-zipformer
Users that are interested in simplistic-zipformer are comparing it to the libraries listed below
Sorting:
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆17Sep 13, 2024Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 5 months ago
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆19Apr 10, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆22Oct 19, 2025Updated 4 months ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆251Dec 12, 2025Updated 2 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- An automatic sample identification (ASID) system using a contrastively trained GNN encoder.☆13Sep 21, 2025Updated 5 months ago
- A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…☆12Dec 10, 2022Updated 3 years ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆46Sep 6, 2023Updated 2 years ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆49Jul 28, 2025Updated 7 months ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Official Repository of Six Dragons Fly Again (ISMIR 2024)☆13Nov 13, 2025Updated 3 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- Speech Separation☆10Jan 6, 2022Updated 4 years ago
- ☆13Jul 23, 2024Updated last year
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- Material for the course of "Mathematics of Transformer"☆19Aug 3, 2025Updated 7 months ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated 11 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- Speech enhancement| Beamforming| NN Mask Estimation| LSTM| DTLN☆15Mar 8, 2023Updated 2 years ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated last year
- iSeparate library for the SDX2023 challenge☆14Dec 15, 2023Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Apr 3, 2025Updated 11 months ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- ☆20Aug 25, 2025Updated 6 months ago
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆38Oct 10, 2025Updated 4 months ago
- ☆16Jun 1, 2023Updated 2 years ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆14Nov 24, 2025Updated 3 months ago
- ☆15Oct 31, 2022Updated 3 years ago
- ☆13Oct 11, 2024Updated last year