Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch
☆17Jun 3, 2024Updated last year
Alternatives and similar repositories for simplistic-zipformer
Users that are interested in simplistic-zipformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 6 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- One-shot Generative Prior in Hankel-k-space for Parallel Imaging Reconstruction☆12Dec 4, 2024Updated last year
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆19Apr 10, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A curated collection of prompts for Grok Imagine by xAI☆25Oct 19, 2025Updated 5 months ago
- NUEDC 2021 G by OpenMV4☆13Nov 19, 2021Updated 4 years ago
- Chinese speech recognition | 中文语音识别 (使用AISHELL-3数据集训练语音识别模型)☆11Oct 17, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 10 months ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated 11 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆46Sep 6, 2023Updated 2 years ago
- Unofficial implementation of SCP-GAN☆18Jul 4, 2023Updated 2 years ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- An automatic sample identification (ASID) system using a contrastively trained GNN encoder.☆13Sep 21, 2025Updated 6 months ago
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 4 months ago
- An implementation of the Orthogonal Matching Pursuit (OMP) algorithm for recovery of signals in compressive sensing☆22Feb 16, 2018Updated 8 years ago
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- ☆12Dec 14, 2024Updated last year
- A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…☆12Dec 10, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Sep 12, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- ☆17Mar 10, 2024Updated 2 years ago
- Material for the course of "Mathematics of Transformer"☆20Aug 3, 2025Updated 7 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official code for SongEcho☆53Mar 3, 2026Updated 3 weeks ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆51Jul 28, 2025Updated 7 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Implementation of papers in 101 lines of code.☆18Nov 12, 2023Updated 2 years ago
- ☆15Apr 4, 2025Updated 11 months ago