☆186Feb 27, 2026Updated last week
Alternatives and similar repositories for chirp-stt
Users that are interested in chirp-stt are comparing it to the libraries listed below
Sorting:
- ☆30Jan 22, 2026Updated last month
- All-in-one Speech Transcription☆10Jan 25, 2026Updated last month
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 5 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆10Jan 1, 2018Updated 8 years ago
- Raspberry Pi Pico Home Automation Firmware☆15Jan 23, 2023Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Sisyphus recipies for ASR☆19Feb 26, 2026Updated last week
- ☆17Apr 14, 2023Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆30Mar 3, 2026Updated last week
- Name speaks for itself☆44Jun 9, 2025Updated 9 months ago
- Wake-on-LAN in rust☆18Jul 11, 2025Updated 7 months ago
- Faster Whisper ASR transcription with CTranslate2☆24Oct 25, 2024Updated last year
- Ultra fast and portable Parakeet implementation for on-device inference in C++ using Axiom with MPS+Unified Memory☆240Updated this week
- General tools for voice analysis.☆25Jul 30, 2025Updated 7 months ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆29Jan 7, 2024Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Speaker diarization service☆28Feb 24, 2026Updated last week
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆25Mar 6, 2024Updated 2 years ago
- Very fast, accurate speaker diarization☆241Feb 7, 2026Updated last month
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆36Feb 5, 2026Updated last month
- Balthazar's case and the design. Updates regularly as we progress.☆12Oct 22, 2024Updated last year
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated 11 months ago
- Speaker Diarization with Transformers☆70Jun 8, 2025Updated 9 months ago
- ☆27Jan 19, 2021Updated 5 years ago
- Custom implementation of a T-Code protocol for connection to Intiface software for Flipper Zero devices☆33Feb 20, 2024Updated 2 years ago
- Eye exploration☆31Nov 29, 2025Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- Highly ergonomic and portable helpers for terminal navigation.☆20Nov 3, 2025Updated 4 months ago
- ☆12Oct 22, 2019Updated 6 years ago
- Retrieves EXIF data properties from digital image files and saves the info to a CSV-file in a defined directory (a Windows PowerShell scr…☆11Jul 5, 2018Updated 7 years ago
- A demo app showing you how to integrate with Google Cloud Translate API☆10Mar 20, 2019Updated 6 years ago
- a kws demo on android☆40May 28, 2024Updated last year
- ChatGPT-Executor is a server application that empowers ChatGPT to execute Windows commands, unlocking a wide range of applications and ca…☆15Jun 30, 2023Updated 2 years ago
- List of repositories relevant to VITS.☆36Feb 26, 2023Updated 3 years ago