来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
☆29Nov 20, 2024Updated last year
Alternatives and similar repositories for Paraformer-V2
Users that are interested in Paraformer-V2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- ☆41May 12, 2026Updated 2 weeks ago
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Oct 14, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆26Aug 21, 2024Updated last year
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15May 30, 2019Updated 6 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆20Jun 12, 2025Updated 11 months ago
- ☆13Sep 25, 2024Updated last year
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Jun 2, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A list of papers for child ASR☆53Oct 8, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆57Dec 6, 2023Updated 2 years ago
- ☆59Apr 24, 2024Updated 2 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- ☆19Sep 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- ☆12Nov 7, 2024Updated last year
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆14Jan 24, 2026Updated 4 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆77Jul 29, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆37Jan 28, 2026Updated 4 months ago
- ☆21Jul 22, 2022Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆16Mar 26, 2025Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated last year
- ☆17Apr 24, 2025Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Source code for Consistent ensemble distillation for audio tagging☆68Mar 20, 2026Updated 2 months ago