janhq / WhisperSpeechView external linksLinks
Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on multilingual with minimal impact on its original English capabilities.
☆17Jan 20, 2025Updated last year
Alternatives and similar repositories for WhisperSpeech
Users that are interested in WhisperSpeech are comparing it to the libraries listed below
Sorting:
- ☆14Jun 25, 2025Updated 7 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- Official repo for the Vietnam-Celeb dataset☆25Aug 27, 2023Updated 2 years ago
- Telegram bot to help you with your findings 🚀☆23Jul 11, 2024Updated last year
- HiFi-SR is a Python-based pipeline for the detection of plant mitochondrial structural rearrangements based on the mapping of PacBio high…☆10Apr 15, 2025Updated 10 months ago
- InSales e-commerce platform API bindings☆14Jul 13, 2024Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 8 months ago
- AI DJ Mix Generator - a fully automated system that creates a mix from input of songs closely resembling real life djs work. Includes adv…☆16Jul 2, 2025Updated 7 months ago
- Not everyone can code, but everyone can learn. This Project is an AI powered DSA/Competitive Programming Helper with an inbuilt editor to…☆13Jun 3, 2025Updated 8 months ago
- ☆56Feb 8, 2026Updated last week
- This contains a practical guide for non-technical users on how to use OpenAI's Whisper for transcription and translation☆12May 8, 2024Updated last year
- Local text-to-speech in your browser with Piper TTS☆16Aug 13, 2025Updated 6 months ago
- 📱 Record iOS devices from command line☆15Jul 14, 2020Updated 5 years ago
- A local, voice-controlled AI assistant with the personality of HAL 9000 from 2001: A Space Odyssey.☆20Aug 16, 2025Updated 6 months ago
- ☆29Dec 20, 2025Updated last month
- Nanos klib for NVIDIA GPUs☆14Mar 25, 2025Updated 10 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 4 months ago
- Summary of all repositories for my public contents, mostly Python, in Jupyter Notebooks, PDFs, Markdowns, and more!☆11Aug 24, 2021Updated 4 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- The demo page for ALMTokenizer☆58Apr 14, 2025Updated 10 months ago
- This repository provides some useful snippets that you may need in some situations.☆12Jan 16, 2024Updated 2 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- A transparent glass style for qt applications.☆32Jan 26, 2026Updated 3 weeks ago
- An intuitive context menu based diff plugin for Sublime Text☆13Jul 17, 2023Updated 2 years ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Updated this week
- Unofficial implementation of wavenext vocoder☆57Aug 28, 2024Updated last year
- Tired of long text inputs automatically converting to attachments in Claude AI? ClaudePaster lets you paste lengthy content while maintai…☆10Nov 25, 2024Updated last year
- BookWorm: A Dataset for Character Description and Analysis [EMNLP Findings 2024]☆14Feb 28, 2025Updated 11 months ago
- Package for word stress detection☆11Jan 27, 2023Updated 3 years ago
- A vscode extension for designing AIGC applications.☆11Jan 15, 2024Updated 2 years ago
- zero shot NER fine tuning☆14Mar 17, 2025Updated 10 months ago
- A Pytorch Lightning WGAN-gp to generate faces☆11Jan 26, 2021Updated 5 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Noted is an all-in-one workspace application, that helps you for note-making 📝, project management 📅, collaboration 👥, and more! 🛠️☆17Nov 11, 2024Updated last year
- Most versatile Telegram torrent and youtube-dl bot.☆10Jan 18, 2025Updated last year
- Awesome Multimodal Fusion in Speech Emotion Recognition☆13Nov 11, 2025Updated 3 months ago
- Piper based VoiceDock TTS implementation☆11Aug 12, 2023Updated 2 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year