☆180Jun 26, 2024Updated last year
Alternatives and similar repositories for whisper-acft
Users that are interested in whisper-acft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44Jul 11, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A Kotlin Multiplatform Project utilizing ggwave, a data-over-sound library.☆20Nov 23, 2024Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆149May 18, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆19Jul 12, 2025Updated 8 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Simple audio AE☆13Nov 10, 2024Updated last year
- Whisper with Medusa heads☆863Aug 6, 2025Updated 7 months ago
- ☆19Nov 28, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆19Sep 15, 2025Updated 6 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆55Apr 15, 2024Updated last year
- Go language bindings for the ggwave C++ library☆14Apr 9, 2025Updated 11 months ago
- ☆23Jun 24, 2024Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆857Nov 16, 2024Updated last year
- discord-bot A powerful discord bot with a ton of commands. It can also act as a Music bot & supports Slash Commands☆11May 21, 2023Updated 2 years ago
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago
- Offline voice input panel & keyboard with punctuation for Android.☆111Jun 1, 2024Updated last year
- Port of Meta's Encodec in C/C++☆228Dec 4, 2024Updated last year
- ggml implementation of BERT☆497Feb 23, 2024Updated 2 years ago
- A mobile Implementation of llama.cpp☆26Oct 11, 2023Updated 2 years ago
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 6 months ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Nov 26, 2024Updated last year
- Mirror of FUTO's Voice Input, an Android Voice Keyboard for Speech-To-Text transcribing using Whisper, supporting large multilanguage mod…☆20Nov 21, 2024Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆108Mar 15, 2026Updated last week
- I publish my weekly research here☆20Jun 26, 2025Updated 9 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Dec 12, 2022Updated 3 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆927Jun 3, 2025Updated 9 months ago