George0828Zhang/simulst

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/George0828Zhang/simulst)

George0828Zhang / simulst

PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.

☆25

Alternatives and similar repositories for simulst

Users that are interested in simulst are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
dqqcasia / mosst
View on GitHub
☆27Aug 31, 2022Updated 3 years ago
MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated 2 weeks ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
ictnlp / Dual-Path
View on GitHub
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"
☆12Mar 31, 2022Updated 4 years ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
yazone / BasedRuleQA_Parser
View on GitHub
基于规则匹配的问答系统中的解析器，the parser of based rule QA system
☆12Mar 13, 2020Updated 6 years ago
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
bartlomiej-pluta / android-tts-server
View on GitHub
The Android application providing user with REST-based interface for utilizing built-in Android's TTS engine. The web service is highly c…
☆11Jul 28, 2020Updated 6 years ago
ga642381 / FlappyBird
View on GitHub
Super Flappy Bird in p5.js
☆10Mar 8, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Sanyuan-Chen / CSS_with_EETransformer
View on GitHub
Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
☆12Sep 2, 2021Updated 4 years ago
BUTSpeechFIT / BrnoLM
View on GitHub
A neural language modeling toolkit built on PyTorch
☆19Mar 17, 2023Updated 3 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
Zildj1an / FIFO-Driver
View on GitHub
Character device driver working as FIFO pipe, created with a Linux Kernel module (SMP-Safe). Works on Android's kernel too.
☆13Jun 14, 2021Updated 5 years ago
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
ZJUGKC / GKC
View on GitHub
General Kind C++
☆10Dec 17, 2023Updated 2 years ago
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago
boris-kuz / jaxloudnorm
View on GitHub
Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆13Jan 29, 2025Updated last year
MingLunHan / CIF-ColDec
View on GitHub
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25Jul 14, 2026Updated 2 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
erasedwalt / CTC-ASR
View on GitHub
An implementation of Jasper, QuartzNet, Citrinet and pipeline for training CTC-based ASR models
☆12Nov 13, 2021Updated 4 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
ga642381 / Taiwanese-Translation
View on GitHub
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
☆13Oct 15, 2022Updated 3 years ago
huangyz0918 / kws-continual-learning
View on GitHub
[ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting
☆17Jun 7, 2022Updated 4 years ago
StephenLee2016 / simple-chatbot
View on GitHub
基于规则和相似匹配的闲聊机器人
☆13Nov 8, 2017Updated 8 years ago
facebookresearch / SimulEval
View on GitHub
SimulEval: A General Evaluation Toolkit for Simultaneous Translation
☆126Sep 13, 2024Updated last year
zake7749 / Kyara
View on GitHub
[Kaggle-2nd] Lightweight yet Effective Chinese LLM.
☆55Jun 14, 2025Updated last year
xiaomin418 / CFSum
View on GitHub
☆13Jan 9, 2024Updated 2 years ago
mtkresearch / generative-fusion-decoding
View on GitHub
Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition syst…
☆87Jul 31, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jeffeuxMartin / meta-learning-hlp
View on GitHub
A publishing website of a table collecting meta-learning-related papers in the area of human language processing.
☆17Aug 2, 2021Updated 4 years ago
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
voidful / vall-e-encodec
View on GitHub
☆41May 15, 2023Updated 3 years ago
kyegomez / AudioFlamingo
View on GitHub
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…
☆39Jan 27, 2025Updated last year
voidful / SpeechMix
View on GitHub
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆46Jul 3, 2025Updated last year
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
grtzsohalf / buy_vs_rent_and_invest
View on GitHub
☆15Sep 9, 2021Updated 4 years ago