High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback.
☆10Sep 17, 2025Updated 9 months ago
Alternatives and similar repositories for SpeedScribe
Users that are interested in SpeedScribe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Dec 24, 2022Updated 3 years ago
- ☆34Jan 25, 2026Updated 5 months ago
- ☆14Jun 23, 2024Updated 2 years ago
- An AI-powered image dataset captioning tool☆31Feb 21, 2026Updated 4 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆39Apr 6, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 3 years ago
- Next-generation, fully open-source refacer. Images. GIFs. TIFFs. Full-length videos. Bulk refacing☆42May 16, 2025Updated last year
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- The implementation of the paper *SinGS: Animatable Single-Image Human Gaussian Splats with Kinematic Priors* [CVPR 2025]☆22Nov 11, 2025Updated 7 months ago
- Personal GPEN scripts within the GPEN-Windows stand-alone package.☆20Jun 5, 2022Updated 4 years ago
- Vid Driven Portrait Animation 🤢 😷☆18Jul 7, 2024Updated last year
- ☆25Sep 27, 2022Updated 3 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Musculoskeletal Analysis extension for 3D Slicer. Currently has cortical, cancellous, and bone density analysis.☆13May 2, 2024Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- ☆46Dec 3, 2024Updated last year
- ☆19Jan 15, 2024Updated 2 years ago
- A fork of Rope with webcam support☆13Mar 13, 2024Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- NeMo: a toolkit for conversational AI☆12Dec 23, 2022Updated 3 years ago
- Future version of the AnyBody Managed Model Repository with a full thoracic spine model.☆21Updated this week
- StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows☆51Apr 29, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A front-end GUI for interacting with AI Horde's distributed cluster of Stable Diffusion workers☆26Jul 4, 2025Updated 11 months ago
- Post-processing OCR errors with seq2seq models☆28Jul 30, 2020Updated 5 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- Centralized multi-channel notification management component for streamlined communication across email, SMS, WhatsApp, and push notificat…☆13Updated this week
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- ☆20May 3, 2024Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Jan 4, 2023Updated 3 years ago
- ComfyUI workflows☆11Sep 19, 2024Updated last year
- Full-stack AI video generation app with image/text input and premium NSFW toggle☆55Jun 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Streamlit-based Chatbot Arena for Ollama LLMs☆14May 19, 2024Updated 2 years ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆78Jun 19, 2025Updated last year
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆40Aug 4, 2023Updated 2 years ago
- Google Colab (without Gradio) notebook for generating AI song covers. YouTube download audio, best voice separation, RVC inference, autom…☆17Mar 20, 2024Updated 2 years ago
- ☆21Dec 8, 2023Updated 2 years ago
- Official repo for the NCR Crypto Meetup☆17Jun 1, 2022Updated 4 years ago
- Streamlit app to visualize and edit TTS datasets☆16Dec 15, 2021Updated 4 years ago