facebookresearch / fbai-speechView external linksLinks
Repo for the FB AI Speech team.
☆25Aug 24, 2021Updated 4 years ago
Alternatives and similar repositories for fbai-speech
Users that are interested in fbai-speech are comparing it to the libraries listed below
Sorting:
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- ☆24Sep 20, 2024Updated last year
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- ☆86Jul 31, 2025Updated 6 months ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆68Dec 30, 2025Updated last month
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Apr 9, 2017Updated 8 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated last year
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 2 years ago
- ☆15Sep 13, 2022Updated 3 years ago
- MultiLingualBot: a simple multi-lingual bot that can respond to questions on academic subjects☆11Dec 8, 2022Updated 3 years ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- ☆31Dec 2, 2020Updated 5 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- ☆17May 5, 2024Updated last year
- Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation☆15Aug 28, 2020Updated 5 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 3 months ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆15Apr 16, 2020Updated 5 years ago
- ☆15Jul 4, 2024Updated last year
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 4 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- An advance kaldi wrapper for Pyhton☆38Mar 1, 2021Updated 4 years ago