byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆53Updated 5 months ago
Alternatives and similar repositories for RealSI:
Users that are interested in RealSI are comparing it to the libraries listed below
- ☆142Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated 2 weeks ago
- ☆158Updated 5 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆219Updated last week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- GPT-4o-level, real-time spoken dialogue system.☆321Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- ☆195Updated 7 months ago
- flow mirror models from JZX AI Labs☆45Updated 7 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆28Updated last week
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆46Updated 3 weeks ago
- Its an open source LLM based on MOE Structure.☆58Updated 10 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆475Updated this week
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- PodAgent: A Comprehensive Framework for Podcast Generation☆79Updated 3 weeks ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 8 months ago
- A toolkit for speaker diarization.☆185Updated this week
- Real time faster whisper gradio☆26Updated 7 months ago
- openai realtime webrtc python client☆42Updated 4 months ago
- A gradio webui for Andrewyng translation-agent☆29Updated 5 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 4 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 3 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆34Updated 8 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆153Updated 3 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- [NAACL'25] TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 10 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆185Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆113Updated 2 months ago