来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆27Nov 20, 2024Updated last year
Alternatives and similar repositories for Paraformer-V2
Users that are interested in Paraformer-V2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆25Aug 21, 2024Updated last year
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15May 30, 2019Updated 6 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆18Jun 12, 2025Updated 10 months ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Jun 2, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- A list of papers for child ASR☆52Oct 8, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆55Dec 6, 2023Updated 2 years ago
- ☆58Apr 24, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- ☆19Sep 10, 2024Updated last year
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Nov 7, 2024Updated last year
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆36Jan 28, 2026Updated 2 months ago
- ☆20Jul 22, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 11 months ago
- ☆16Apr 24, 2025Updated 11 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Source code for Consistent ensemble distillation for audio tagging☆63Mar 20, 2026Updated 3 weeks ago
- ☆13Sep 12, 2024Updated last year