☆19Mar 16, 2026Updated this week
Alternatives and similar repositories for InfiniSST
Users that are interested in InfiniSST are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- ☆11Sep 5, 2025Updated 6 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 9 months ago
- ☆34Mar 25, 2023Updated 2 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last month
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆79Jul 4, 2025Updated 8 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆32Oct 23, 2025Updated 4 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ☆19Jan 8, 2025Updated last year
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- ☆11Jul 30, 2025Updated 7 months ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated 10 months ago
- A disitributed implementation of alphafold3 base on xfold and tpp-pytorch-extension☆12May 25, 2025Updated 9 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 11 months ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- ☆13May 23, 2021Updated 4 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- Text-to-Speech Latency Benchmark☆22Mar 13, 2026Updated last week
- ☆113Oct 21, 2025Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- ☆15Apr 4, 2025Updated 11 months ago
- A benchmark for evaluating audio encoders on various audio tasks.☆44Dec 11, 2025Updated 3 months ago
- Reference implementation of the paper "Efficient and Scalable Graph Generation through Iterative Local Expansion"☆16Aug 27, 2025Updated 6 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- ☆17Mar 1, 2024Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- Collect papers related to personalized text generation☆18Sep 6, 2021Updated 4 years ago
- ASR client for Triton ASR Service☆38Jan 12, 2026Updated 2 months ago
- ☆18Sep 19, 2023Updated 2 years ago
- Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"☆17May 17, 2021Updated 4 years ago