byteresearchcla/RealSI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/byteresearchcla/RealSI)

byteresearchcla / RealSI

RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios

☆85

Alternatives and similar repositories for RealSI

Users that are interested in RealSI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
LeiLiLab / InfiniSST
View on GitHub
☆25May 27, 2026Updated last month
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
biaofuxmu / wav2vec-S
View on GitHub
Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"
☆12Apr 21, 2026Updated 3 months ago
ByteDance-Seed / Seed-X-7B
View on GitHub
☆170Aug 18, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
XiaomiMiMo / MiMo-Audio-Tokenizer
View on GitHub
A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.
☆145Sep 19, 2025Updated 10 months ago
TianduoWang / MsAT
View on GitHub
[ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707
☆24Jun 7, 2023Updated 3 years ago
smthemex / ComfyUI_FollowYourEmoji
View on GitHub
You can using Follow_Your_Emoji in ComfyUI
☆17Apr 11, 2025Updated last year
ffaltings / InteractiveTextGeneration
View on GitHub
☆34Mar 25, 2023Updated 3 years ago
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated 2 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
aixplain / NoRefER
View on GitHub
☆18Jun 5, 2026Updated last month
FactrueSolin / cf-page-publish-mcp
View on GitHub
页面发布mcp工具，可以将html页面直接发布到cloudflare的worker中，并获得预览链接。
☆15Jul 26, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MrSupW / ICMC-ASR_Baseline
View on GitHub
The baseline system for the ICASSP2024 ICMC-ASR Challenge.
☆57Dec 6, 2023Updated 2 years ago
ictnlp / DiSeg
View on GitHub
Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"
☆37Dec 6, 2023Updated 2 years ago
AmphionTeam / SD-Eval
View on GitHub
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
☆57Jun 25, 2024Updated 2 years ago
xingchensong / FlashCosyVoice
View on GitHub
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
☆250Feb 25, 2026Updated 4 months ago
OFA-Sys / AIR-Bench
View on GitHub
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
☆133Dec 9, 2024Updated last year
Soul-AILab / SAC
View on GitHub
[ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.
☆108Nov 1, 2025Updated 8 months ago
MatthewCYM / VoiceBench
View on GitHub
[TACL'26] VoiceBench: Benchmarking LLM-Based Voice Assistants
☆378Jun 11, 2026Updated last month
hlt-mt / simulstream
View on GitHub
simulstream is a Python library for simultaneous/streaming speech recognition and translation. It enables both the simulation with existi…
☆29Jul 9, 2026Updated last week
MCG-NJU / Video-DC
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sorahjy / chinese_fuzzy_matching
View on GitHub
100行解决中文模糊实体识别with字典树和编辑距离 Chinese fuzzy entity matching with prefix tree and distance editing
☆11Sep 25, 2023Updated 2 years ago
nethermanpro / transvip
View on GitHub
☆164Nov 29, 2024Updated last year
zruiii / Chinese-Mimi
View on GitHub
Chinese-Mimi 是对 Moshi 模型的声码器进行了中文语料上的适配。
☆36Mar 13, 2025Updated last year
NKU-HLT / PB-DSR
View on GitHub
[Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation
☆14Nov 28, 2024Updated last year
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Updated this week
XiaomiMiMo / MiMo-Audio-Eval
View on GitHub
☆88Jun 17, 2026Updated last month
EIT-NLP / LLaSO
View on GitHub
☆116Oct 21, 2025Updated 9 months ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
DanielLin94144 / Full-Duplex-Bench
View on GitHub
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
☆236May 20, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NKU-HLT / EmotionTalk
View on GitHub
Dataset [ACL 2026]
☆35Jul 31, 2025Updated 11 months ago
ASLP-lab / HumDial-FDBench
View on GitHub
The Full-Duplex Interaction Track of the ICASSP 2026 Human-like Spoken Dialogue Systems Challenge aims to advance the evaluation of full-…
☆36Apr 27, 2026Updated 2 months ago
ga642381 / SpeechGen
View on GitHub
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
☆77Jun 9, 2023Updated 3 years ago
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
ku-nlp / VISA
View on GitHub
An ambiguous subtitles dataset for visual scene-aware machine translation
☆14Oct 17, 2022Updated 3 years ago