byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆53Updated 4 months ago
Alternatives and similar repositories for RealSI:
Users that are interested in RealSI are comparing it to the libraries listed below
- We Speech Transcript based on LLM, in 300 lines of code.☆157Updated last month
- GPT-4o-level, real-time spoken dialogue system.☆311Updated 2 months ago
- ☆193Updated 6 months ago
- flow mirror models from JZX AI Labs☆44Updated 6 months ago
- ☆159Updated 4 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆212Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 2 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆438Updated last week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆88Updated 6 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆148Updated 2 months ago
- Real time faster whisper gradio☆26Updated 6 months ago
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech and video generation APIs.☆146Updated this week
- A toolkit for speaker diarization.☆183Updated 3 weeks ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 weeks ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆176Updated last month
- Its an open source LLM based on MOE Structure.☆58Updated 9 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆78Updated 2 weeks ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated 3 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆77Updated 3 weeks ago
- ☆31Updated last month
- A lightweight end-to-end text-to-speech model☆112Updated last month
- ☆109Updated 8 months ago
- ☆64Updated 7 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆68Updated last week
- 🤗 R1-AQA Model: mispeech/r1-aqa☆230Updated 3 weeks ago
- GLM Series Edge Models☆134Updated last month
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆50Updated 6 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆43Updated 7 months ago
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆206Updated 4 months ago