来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆27Nov 20, 2024Updated last year
Alternatives and similar repositories for Paraformer-V2
Users that are interested in Paraformer-V2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- ☆11Oct 14, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆25Aug 21, 2024Updated last year
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15May 30, 2019Updated 6 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆18Jun 12, 2025Updated 9 months ago
- ☆13Sep 25, 2024Updated last year
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Jun 2, 2023Updated 2 years ago
- A list of papers for child ASR☆52Oct 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- faster inference☆28Jan 20, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆58Apr 24, 2024Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆55Dec 6, 2023Updated 2 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- ☆19Sep 10, 2024Updated last year
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Nov 7, 2024Updated last year
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆36Jan 28, 2026Updated 2 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 10 months ago
- Source code for Consistent ensemble distillation for audio tagging☆60Mar 20, 2026Updated last week
- ☆16Apr 24, 2025Updated 11 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- ☆13Sep 12, 2024Updated last year