Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch
☆20Jun 3, 2024Updated last year
Alternatives and similar repositories for simplistic-zipformer
Users that are interested in simplistic-zipformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 7 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- One-shot Generative Prior in Hankel-k-space for Parallel Imaging Reconstruction☆12Dec 4, 2024Updated last year
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆19Apr 10, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- NUEDC 2021 G by OpenMV4☆13Nov 19, 2021Updated 4 years ago
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- Chinese speech recognition | 中文语音识别 (使用AISHELL-3数据集训练语音识别模型)☆11Oct 17, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 11 months ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated last year
- This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…☆46Sep 6, 2023Updated 2 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆260Dec 12, 2025Updated 4 months ago
- Unofficial implementation of SCP-GAN☆18Jul 4, 2023Updated 2 years ago
- [KDD 2026] Voxlect: A Speech Foundation Model Benchmark for Modeling Dialects and Regional Languages Around the Globe☆32Aug 10, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- 一个实时交互的语音项目☆37Jan 29, 2026Updated 3 months ago
- ☆10Jun 11, 2024Updated last year
- Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…☆14Nov 13, 2023Updated 2 years ago
- An automatic sample identification (ASID) system using a contrastively trained GNN encoder.☆14Sep 21, 2025Updated 7 months ago
- An implementation of the Orthogonal Matching Pursuit (OMP) algorithm for recovery of signals in compressive sensing☆23Feb 16, 2018Updated 8 years ago
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 5 months ago
- iSeparate library for the SDX2023 challenge☆15Dec 15, 2023Updated 2 years ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- ☆12Dec 14, 2024Updated last year
- A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…☆12Dec 10, 2022Updated 3 years ago
- ☆17Mar 10, 2024Updated 2 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 22, 2026Updated 2 weeks ago
- Material for the course of "Mathematics of Transformer"☆22Aug 3, 2025Updated 9 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…☆11Apr 2, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆20Apr 20, 2026Updated 2 weeks ago
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆22Jul 31, 2021Updated 4 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- Official code for SongEcho☆59Mar 3, 2026Updated 2 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated 11 months ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆52Jul 28, 2025Updated 9 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 10 months ago