PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.
☆25Oct 3, 2022Updated 3 years ago
Alternatives and similar repositories for simulst
Users that are interested in simulst are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆36Feb 10, 2024Updated 2 years ago
- ☆27Aug 31, 2022Updated 3 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆79Jan 9, 2025Updated last year
- ☆13Sep 25, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 3 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆31Mar 6, 2025Updated last year
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- 基于规则匹配的问答系统中的解析器,the parser of based rule QA system☆12Mar 13, 2020Updated 6 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 3 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…☆11Jul 28, 2020Updated 5 years ago
- ☆15Mar 12, 2024Updated 2 years ago
- Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer☆12Sep 2, 2021Updated 4 years ago
- ☆12Mar 20, 2026Updated last week
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models☆12Nov 13, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆124Sep 13, 2024Updated last year
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- 基于规则和相似匹配的闲聊机器人☆13Nov 8, 2017Updated 8 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆53Jun 14, 2025Updated 9 months ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition syst…☆87Jul 31, 2025Updated 7 months ago
- A publishing website of a table collecting meta-learning-related papers in the area of human language processing.☆17Aug 2, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- General Kind C++☆10Dec 17, 2023Updated 2 years ago
- ☆41May 15, 2023Updated 2 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- Implementaion of our ACL Findings 2023 paper "BIGVIDEO: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translati…☆15Mar 16, 2024Updated 2 years ago
- ☆15Sep 9, 2021Updated 4 years ago