Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on multilingual with minimal impact on its original English capabilities.
☆17Jan 20, 2025Updated last year
Alternatives and similar repositories for WhisperSpeech
Users that are interested in WhisperSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Mar 25, 2025Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆22Jun 7, 2025Updated 11 months ago
- ☆14Apr 8, 2026Updated last month
- PDFwhisper allows you to have a conversation with your PDF docs. Finding info on PDF files is now easier than ever. 🚀🔥 Most secure auth…☆15Oct 30, 2024Updated last year
- A Multi-branch CI-CD Pipeline Using Jenkins, Docker, AWS, Maven To Deploy an Odoo ERP custom module & a simple Java Maven web app.☆13Dec 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆23Jun 5, 2025Updated 11 months ago
- This app is designed to assist applicants in searching for potential jobs and to help recruiters find talented candidates.☆15Feb 16, 2025Updated last year
- ☆19Jun 12, 2025Updated 10 months ago
- Official repo for the Vietnam-Celeb dataset☆26Aug 27, 2023Updated 2 years ago
- App to search images with Unsplash's API and react-query 🔋☆10Oct 7, 2022Updated 3 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆51May 22, 2025Updated 11 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Jul 1, 2024Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Deep-learning project utilizing 3D human pose estimation to compare different poses☆13Feb 25, 2024Updated 2 years ago
- HiFi-SR is a Python-based pipeline for the detection of plant mitochondrial structural rearrangements based on the mapping of PacBio high…☆10Apr 15, 2025Updated last year
- Large Language Models (LLMs) Learning Resources☆20Jun 16, 2024Updated last year
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆22Nov 10, 2025Updated 5 months ago
- Summary of all repositories for my public contents, mostly Python, in Jupyter Notebooks, PDFs, Markdowns, and more!☆11Aug 24, 2021Updated 4 years ago
- ☆12Nov 1, 2023Updated 2 years ago
- Implementation for the different ML tasks on Kaggle platform with GPUs.☆25Jan 27, 2026Updated 3 months ago
- Anomalous sound detection with machine learning and deep learning☆14Jun 24, 2024Updated last year
- Trends, Tools, News timeline ...☆20Oct 13, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Telegram bot to help you with your findings 🚀☆23Jul 11, 2024Updated last year
- Repository dedicated to enhancing data retrieval and processing efficiencies in Google Cloud's Vertex AI by implementing a semantic cachi…☆18Jun 12, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- iBVP Dataset: RGB-Thermal rPPG Dataset with High Resolution Signal Quality Labels [Electronics 2024]☆26Apr 7, 2026Updated last month
- End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese reviews, using PhoBERT as pretrained model☆32Jul 9, 2024Updated last year
- openvino version of openai/whisper☆15Oct 8, 2024Updated last year
- dMel: Speech Tokenization Made Simple☆20May 13, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Aug 19, 2024Updated last year
- MLSys competition for the best MOE NKI kernels☆40Apr 30, 2026Updated last week
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆184Jun 20, 2025Updated 10 months ago
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.☆18Apr 19, 2025Updated last year
- Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'202…☆10Aug 13, 2024Updated last year
- Text Analysis: Implementation of ULMFiT by Howard & Ruder on Twitter dataset☆10Feb 7, 2019Updated 7 years ago
- BookWorm: A Dataset for Character Description and Analysis [EMNLP Findings 2024]☆14Feb 28, 2025Updated last year