☆177Jun 26, 2024Updated last year
Alternatives and similar repositories for whisper-acft
Users that are interested in whisper-acft are comparing it to the libraries listed below
Sorting:
- ☆44Jul 11, 2024Updated last year
- ☆18Nov 28, 2024Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- ☆18Jul 12, 2025Updated 7 months ago
- Simple audio AE☆13Nov 10, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆147May 18, 2025Updated 9 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago
- ☆13Sep 12, 2024Updated last year
- ☆29Feb 13, 2026Updated 3 weeks ago
- Android spec sheet generator☆19May 17, 2015Updated 10 years ago
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Jun 1, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Speech-to-text transcription VST3/ARA plugin☆54Feb 2, 2026Updated last month
- Next generation linbo☆12Jan 31, 2026Updated last month
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- discord-bot A powerful discord bot with a ton of commands. It can also act as a Music bot & supports Slash Commands☆11May 21, 2023Updated 2 years ago
- Gootool for Android☆13Jul 21, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Simple Android app for starting and stopping the system vibrator☆23Dec 23, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- Whisper with Medusa heads☆865Aug 6, 2025Updated 7 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆55Apr 15, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- 通过Shizuku授权,实现修改部分系统设置项。☆17Apr 1, 2024Updated last year
- All-in-one Speech Transcription☆10Jan 25, 2026Updated last month
- A calculator that can deal with distributions ergonomically, to allow users to easily arrive at fermi estimates that incorporate uncertai…☆12Jul 20, 2025Updated 7 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Adds keyboard shortcut support for deleting single emails in Gmail☆18Oct 18, 2025Updated 4 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- plugin manager for OpenVoiceOS , STT/TTS/Wakewords that can be used anywhere☆13Jan 30, 2026Updated last month
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Cuda extensions for PyTorch☆12Dec 2, 2025Updated 3 months ago
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago