FireRedTeam / FireRedChatLinks
A Fully Self-Hosted Solution for Full-Duplex Voice Interaction
☆254Updated 2 weeks ago
Alternatives and similar repositories for FireRedChat
Users that are interested in FireRedChat are comparing it to the libraries listed below
Sorting:
- High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!☆462Updated 3 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆218Updated 9 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆211Updated 7 months ago
- ☆203Updated last year
- ☆11Updated 2 weeks ago
- 年度/半年度/季度 新榜开源项目排行☆200Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 3 months ago
- INTERSPEECH2023: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music☆30Updated last year
- Efficient audio understanding with general audio captions☆362Updated 2 weeks ago
- Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations…☆348Updated 3 months ago
- ☆99Updated last week
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆182Updated 11 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆661Updated 3 weeks ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆175Updated last month
- 🤗 R1-AQA Model: mispeech/r1-aqa☆300Updated 6 months ago
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆41Updated last year
- Github开源项目精选栏目,不定期更新☆238Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆113Updated 2 years ago
- IndexTTS Fine-tuning notebooks☆105Updated 3 months ago
- Simple rule engine based on goyacc☆29Updated 2 years ago
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆20Updated last year
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆436Updated 2 weeks ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆149Updated 2 weeks ago
- Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation…☆580Updated last year
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆35Updated last year
- ☆39Updated 2 months ago
- 一种可编程远程信号发生器(包含lcd1602驱动)☆60Updated 2 years ago
- Fintech Key-Phrase: a New Chinese Financial & High-tech Dataset Accelerating Expression-Level Information Retrieval☆53Updated last year
- SpringCloud Alibaba Micro Service System.☆65Updated 2 years ago
- Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.☆140Updated 3 months ago