☆18Feb 16, 2026Updated last week
Alternatives and similar repositories for InfiniSST
Users that are interested in InfiniSST are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- ☆11Sep 5, 2025Updated 5 months ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 8 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last month
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- ☆32Oct 23, 2025Updated 4 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- ☆19Jan 8, 2025Updated last year
- Sisyphus recipies for ASR☆19Updated this week
- ☆17Mar 1, 2024Updated last year
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated last month
- ☆21Mar 4, 2024Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- ☆32Aug 22, 2024Updated last year
- ☆18Sep 19, 2023Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- ☆29Feb 4, 2025Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆79Jul 4, 2025Updated 7 months ago
- A benchmark for evaluating audio encoders on various audio tasks.☆43Dec 11, 2025Updated 2 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- ☆34Mar 25, 2023Updated 2 years ago
- ASR client for Triton ASR Service☆37Jan 12, 2026Updated last month
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- ☆35Sep 1, 2022Updated 3 years ago
- 2022WHU计算机系统综合设计 基于RISCV的五级流水线CPU Five stage CPU implement based on RISC-V☆11Oct 31, 2023Updated 2 years ago
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using th…☆34Sep 2, 2022Updated 3 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆166Dec 12, 2025Updated 2 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Oct 11, 2024Updated last year
- A TensorFlow-based spoken language identification☆98Mar 22, 2023Updated 2 years ago
- Python3 package for UST(UTAU), INI(setParam), LAB☆37Dec 13, 2025Updated 2 months ago