RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆79Jul 4, 2025Updated 8 months ago
Alternatives and similar repositories for RealSI
Users that are interested in RealSI are comparing it to the libraries listed below
Sorting:
- ☆19Feb 16, 2026Updated 3 weeks ago
- 页面发布mcp工具,可以将html页面直接发布到cloudflare的worker中,并获得预览链接。☆15Jul 26, 2025Updated 7 months ago
- ☆114Oct 21, 2025Updated 4 months ago
- Build your own AI friend☆17Oct 23, 2025Updated 4 months ago
- ☆162Aug 18, 2025Updated 6 months ago
- ☆17Mar 1, 2024Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- create CMakeLists.txt for kaldi☆20Apr 30, 2020Updated 5 years ago
- Dataset☆31Jul 31, 2025Updated 7 months ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆225Aug 6, 2025Updated 7 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆886Feb 27, 2026Updated last week
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Sep 20, 2024Updated last year
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆126Dec 9, 2024Updated last year
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆82Feb 6, 2026Updated last month
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆31Apr 26, 2024Updated last year
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,361Feb 13, 2026Updated 3 weeks ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- ☆34Mar 25, 2023Updated 2 years ago
- A feature-rich concurrency kit, yet another DAG framework☆10Jan 18, 2026Updated last month
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆134Sep 19, 2025Updated 5 months ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆136Feb 23, 2026Updated 2 weeks ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆482Nov 23, 2025Updated 3 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆156Oct 20, 2025Updated 4 months ago
- Run Qwen3.5-35B-A3B with llama.cpp and openclaw on NVIDIA DGX Spark (GB10)☆35Mar 1, 2026Updated last week
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- An n8n workflow designed to automate key aspects of a Sales Development Representative's (SDR) tasks,☆15Sep 15, 2025Updated 5 months ago
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- 一个基于 Next.js 开发的动态二维码生成工具🎉☆74Jan 1, 2026Updated 2 months ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- 短链接服务器,基于proactor的多线程服务器,maysql作为发号器,redis缓存☆10Jun 2, 2021Updated 4 years ago
- Official PyTorch+CUDA Full-functional Web Demo for MiniCPM-o 4.5☆45Updated this week
- 16S pipeline using qiime2 created with snakemake☆12Jan 2, 2026Updated 2 months ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- ☆78Sep 25, 2025Updated 5 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆185Jun 20, 2025Updated 8 months ago
- OcuAssist是一款基于Tauri框架开发的AI辅助眼底多模态诊断软件,集成了AI辅助探测、AI辅助诊断和AI对话等功能,为 眼科医生提供智能化的诊断支持。☆16Sep 29, 2025Updated 5 months ago
- ☆11Sep 17, 2024Updated last year
- Simple LLM Assistant Chatbot using Ollama API For ESP32☆13May 14, 2024Updated last year
- 程序员延寿指南 | A programmer's guide to live longer☆18Jan 30, 2024Updated 2 years ago