thewh1teagle / loud.cppLinks
Whisper.cpp with diarization
☆19Updated last year
Alternatives and similar repositories for loud.cpp
Users that are interested in loud.cpp are comparing it to the libraries listed below
Sorting:
- A full-text search for YouTube subtitles and video metadata with a GUI and command line interface.☆42Updated 2 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆53Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆162Updated 2 years ago
- Resources on AI applications in the music domain☆20Updated 4 months ago
- Tensor library for machine learning☆17Updated 2 years ago
- Neural Text to speech model that is a perfect voice for a home assistant, audiobooks or for screen readers on Linux, Mac and Windows. A f…☆40Updated 2 years ago
- Turn a doc into plaintext which you can listen to using TTS☆20Updated 2 years ago
- A Rust OneNote file parser☆71Updated 2 weeks ago
- C++ version of openWakeWord☆40Updated last year
- Simple agent framework using Ollama tool calling☆10Updated last year
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Updated last week
- An even smaller speech recognizer / force aligner☆37Updated last year
- A chat UI for Llama.cpp☆15Updated 2 months ago
- Speech-to-text transcription VST3/ARA plugin☆53Updated this week
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- On-device noise suppression powered by deep learning☆82Updated 2 weeks ago
- A QT GUI for large language models☆39Updated 2 years ago
- LLM based file organizer☆30Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆128Updated 2 weeks ago
- TTS support with GGML☆218Updated 4 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆49Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 7 months ago
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- ☆29Updated last year
- Interactive duplicate file finder and remover☆164Updated 2 years ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- web based editor for subtitles and transcripts☆143Updated last year
- Application of OpenAI tools such as Whisper, DALL-E, and ChatGPT to generate album covers from audio☆12Updated 2 years ago
- Pluralising Synthetic Intelligence☆20Updated 7 months ago
- The application performs real-time inference on audio from an ALSA capture device☆36Updated 7 months ago