byteresearchcla / RealSIView external linksLinks
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆79Jul 4, 2025Updated 7 months ago
Alternatives and similar repositories for RealSI
Users that are interested in RealSI are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- ☆18Feb 4, 2026Updated last week
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- 页面发布mcp工具,可以将html页面直接发布到cloudflare的worker中,并获得预览链接。☆15Jul 26, 2025Updated 6 months ago
- ☆114Oct 21, 2025Updated 3 months ago
- Build your own AI friend☆17Oct 23, 2025Updated 3 months ago
- ☆162Aug 18, 2025Updated 5 months ago
- ☆17Mar 1, 2024Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- You can using Follow_Your_Emoji in ComfyUI☆17Apr 11, 2025Updated 10 months ago
- ☆16Jun 13, 2022Updated 3 years ago
- Dataset☆28Jul 31, 2025Updated 6 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆835Jan 29, 2026Updated 2 weeks ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆56Jun 25, 2024Updated last year
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆224Aug 6, 2025Updated 6 months ago
- 眼科问诊大模型☆100Jul 16, 2024Updated last year
- ☆23Jun 20, 2023Updated 2 years ago
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆24Jun 7, 2023Updated 2 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆54Dec 6, 2023Updated 2 years ago
- 💻🤖 302 AI Document Editor! 🚀✨☆37Aug 26, 2025Updated 5 months ago
- Chinese-Mimi 是对 Moshi 模型的声码器进行了中文语料上的适配。☆34Mar 13, 2025Updated 11 months ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆80Feb 6, 2026Updated last week
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Aug 29, 2024Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Sep 20, 2024Updated last year
- ☆28Sep 23, 2023Updated 2 years ago
- AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension☆127Dec 9, 2024Updated last year
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,336Sep 22, 2025Updated 4 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆124Sep 21, 2025Updated 4 months ago
- A feature-rich concurrency kit, yet another DAG framework☆10Jan 18, 2026Updated 3 weeks ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆132Sep 19, 2025Updated 4 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆151Oct 20, 2025Updated 3 months ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆475Nov 23, 2025Updated 2 months ago
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆1,238Jun 29, 2025Updated 7 months ago
- 拼好AI(PinAI)是一个简单的AI路由平台,简单就好。☆68Updated this week
- ⚡ An optimized,Next for AiNiee powered by UV. Features intelligent format conversion, multi-profile config system, and stabilized TUI. ⚡ …☆29Updated this week
- Paster core module using KiteX☆10Aug 30, 2023Updated 2 years ago
- 一个基于 Next.js 开发的动态二维码生成工具🎉☆74Jan 1, 2026Updated last month
- A node rpc client for wcferry☆32Sep 24, 2024Updated last year