HLTCHKUST / CI-AVSRView external linksLinks
Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.
☆41Jul 16, 2024Updated last year
Alternatives and similar repositories for CI-AVSR
Users that are interested in CI-AVSR are comparing it to the libraries listed below
Sorting:
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- ☆99Feb 1, 2024Updated 2 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆68Jul 13, 2022Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆52Jul 30, 2020Updated 5 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 2 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- The code repository for the paper "Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization".☆24Nov 12, 2020Updated 5 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 5 months ago
- ☆13Oct 25, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- ☆38May 16, 2022Updated 3 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago