R1ckShi / SeACo-ParaformerView external linksLinks
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆39Mar 15, 2024Updated last year
Alternatives and similar repositories for SeACo-Paraformer
Users that are interested in SeACo-Paraformer are comparing it to the libraries listed below
Sorting:
- ☆15Jul 4, 2024Updated last year
- ☆15Aug 25, 2022Updated 3 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- paraformer(chinense asr) online onnx runtime for python☆53Mar 27, 2024Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 2 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 8 months ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- Official Implementation of GLAP - General Language Audio Pretraining☆61Jan 5, 2026Updated last month
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model☆24Oct 28, 2023Updated 2 years ago
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Sep 12, 2024Updated last year
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- ☆24Sep 20, 2024Updated last year
- Llasa Speed Up☆57Jan 18, 2026Updated 3 weeks ago
- ☆14Nov 26, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- OpenAI-Compatible Frontend for Nvidia Triton Inference ASR/TTS Server☆22Jul 29, 2025Updated 6 months ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆36Dec 17, 2024Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- ☆76Mar 18, 2022Updated 3 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems☆75Jan 25, 2026Updated 3 weeks ago
- faster inference☆28Jan 20, 2025Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago