byteresearchcla / RealSILinks
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆54Updated 6 months ago
Alternatives and similar repositories for RealSI
Users that are interested in RealSI are comparing it to the libraries listed below
Sorting:
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated last month
- flow mirror models from JZX AI Labs☆45Updated 8 months ago
- GPT-4o-level, real-time spoken dialogue system.☆327Updated 4 months ago
- ☆198Updated 8 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆35Updated 8 months ago
- ☆160Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆93Updated 8 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆222Updated last week
- A gradio webui for Andrewyng translation-agent☆29Updated 5 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆53Updated 3 weeks ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 9 months ago
- A toolkit for speaker diarization.☆195Updated 2 weeks ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆82Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated 11 months ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- Real time faster whisper gradio☆26Updated 7 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆40Updated 3 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆61Updated 3 weeks ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 8 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 9 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆499Updated 2 weeks ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆195Updated 3 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆24Updated 2 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆158Updated 3 months ago
- 扣子API对话界面☆14Updated 11 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆102Updated last week