Fine-tune WhisperAI model to your language
☆21Sep 14, 2023Updated 2 years ago
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Train Tesseract LSTM with make on Windows☆10Dec 24, 2023Updated 2 years ago
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆18Sep 25, 2022Updated 3 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆363May 23, 2023Updated 3 years ago
- A python library / model for creating co-references between AMR graph nodes.☆11Dec 11, 2022Updated 3 years ago
- A CNN-based audio denoiser☆10May 2, 2021Updated 5 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- end-to-end automated video generation pipeline designed to create engaging, TikTok-style viral short videos using AI.☆21Jun 7, 2025Updated last year
- SPIE Medical Imaging 2019 Notes By Hao☆16Feb 26, 2019Updated 7 years ago
- This repository helps you extract useful information from Openpose node publisher via comparing position of body nodes and estimate the g…☆16Jun 8, 2018Updated 8 years ago
- Elevate your language models with insightful diversity metrics.☆11Feb 4, 2024Updated 2 years ago
- Automate the KYC Process using OCR (Implemented from scratch)☆12May 23, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Sep 5, 2025Updated 9 months ago
- ☆14Sep 21, 2023Updated 2 years ago
- Data generation, model training and inference for Visual Font Recognition using PyTorch☆19Dec 5, 2023Updated 2 years ago
- CLI tool for Markdown files, offering formatting, AI-powered reviews, linting, spell checking, and link checking to streamline your Markd…☆21Jan 27, 2025Updated last year
- Altostratus Sample for MSDN Magazine articles☆15Jun 28, 2016Updated 9 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- web app for designing and milling simple circuit boards☆14May 7, 2018Updated 8 years ago
- FunAudioLLM homepage☆17Dec 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unified-Multimodal Transformer Pipeline for Political Content Creation: TikTok Reel Generator (Highlight detection + visually tracked ver…☆16May 15, 2023Updated 3 years ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Analysis of XLS-R for Speech Quality Assessment☆15Feb 10, 2025Updated last year
- Lessons Learned from GPU Experiments with Aparapi☆13Apr 17, 2016Updated 10 years ago
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆20Sep 25, 2025Updated 8 months ago
- Digital Audio Effects in JavaScript☆11May 28, 2026Updated last week
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆24Nov 25, 2024Updated last year
- TRITONCACHE implementation of a Redis cache☆17May 8, 2026Updated last month
- Perform OSINT on external targets using Shodan☆24Feb 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- Opensource Light Weight Hotel Enterprise Resource Planning System☆14Feb 5, 2021Updated 5 years ago
- FFT-based windowed spectrum analyzer☆13Mar 10, 2017Updated 9 years ago
- Neuralizer.ai - Visual Neural Network Designer☆14Nov 8, 2022Updated 3 years ago
- Continuous speech recognition for Android demo☆14Feb 20, 2024Updated 2 years ago
- sveltekit + adapter-node + socket.io☆11May 18, 2024Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago