☆201Jun 26, 2024Updated 2 years ago
Alternatives and similar repositories for whisper-acft
Users that are interested in whisper-acft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A Kotlin Multiplatform Project utilizing ggwave, a data-over-sound library.☆20Nov 23, 2024Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆153May 18, 2025Updated last year
- ☆18Jul 12, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Sep 12, 2024Updated last year
- Interact privately with your documents using the power of GPT, 100% privately, no data leaks☆10May 22, 2023Updated 3 years ago
- Whisper with Medusa heads☆861Jun 9, 2026Updated 3 weeks ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆36Jul 31, 2024Updated last year
- ☆20Nov 28, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 3 years ago
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆20Sep 15, 2025Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ggml implementation of BERT☆500Feb 23, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- Secure social networking for Android☆40Feb 3, 2025Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Oct 30, 2023Updated 2 years ago
- Go language bindings for the ggwave C++ library☆14Apr 9, 2025Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Feb 21, 2024Updated 2 years ago
- ☆23Jun 24, 2024Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆34Apr 22, 2026Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆865Nov 16, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆25Mar 6, 2024Updated 2 years ago
- discord-bot A powerful discord bot with a ton of commands. It can also act as a Music bot & supports Slash Commands☆11May 21, 2023Updated 3 years ago
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes☆29Mar 1, 2024Updated 2 years ago
- Offline voice input panel & keyboard with punctuation for Android.☆113Jun 1, 2024Updated 2 years ago
- Fast neural codec compression and generation for audio waveforms☆230Dec 4, 2024Updated last year
- 通过Shizuku授权,实现修改部分系统设置项。☆17Apr 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A chat implementation for FastHTML☆12Sep 14, 2025Updated 9 months ago
- cpp inference for EmotiVoice☆16Jan 1, 2024Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆963Jun 3, 2025Updated last year
- ☆15Nov 26, 2024Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆110Mar 15, 2026Updated 3 months ago
- I publish my weekly research here☆20Jun 26, 2025Updated last year
- ☆19Mar 22, 2024Updated 2 years ago