byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆53Updated 4 months ago
Alternatives and similar repositories for RealSI:
Users that are interested in RealSI are comparing it to the libraries listed below
- We Speech Transcript based on LLM, in 300 lines of code.☆149Updated 3 weeks ago
- GPT-4o-level, real-time spoken dialogue system.☆300Updated last month
- ☆191Updated 6 months ago
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆277Updated last month
- ☆157Updated 3 months ago
- A toolkit for speaker diarization.☆172Updated 4 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated last month
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆210Updated 2 months ago
- flow mirror models from JZX AI Labs☆43Updated 5 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 6 months ago
- Real time faster whisper gradio☆26Updated 5 months ago
- ☆130Updated last month
- A lightweight end-to-end text-to-speech model☆110Updated last month
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆137Updated last month
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆135Updated this week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 2 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.☆342Updated last week
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated 2 months ago
- Scholar Copilot is an intelligent academic writing assistant that enhances the research writing process through AI-powered text completio…☆82Updated last week
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆163Updated 3 weeks ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆63Updated last week
- MaskGCT demo page☆14Updated last month
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆81Updated last year
- ☆108Updated 7 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆77Updated this week
- Speech Diarization for scrum automation☆102Updated last year
- ☆188Updated 7 months ago
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆68Updated 3 weeks ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 6 months ago