Fine-tune WhisperAI model to your language
☆21Sep 14, 2023Updated 2 years ago
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Train Tesseract LSTM with make on Windows☆10Dec 24, 2023Updated 2 years ago
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆17Sep 25, 2022Updated 3 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆42Jun 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Use openai whisper to transcribe your voice into written text completely locally in one command☆11Dec 7, 2023Updated 2 years ago
- Botticelli is an open-source .NET Core framework for building universal chatbots. It enables seamless integration with databases, queue b…☆14Mar 14, 2026Updated 2 weeks ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆361May 23, 2023Updated 2 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- A CNN-based audio denoiser☆10May 2, 2021Updated 4 years ago
- ☆11Oct 25, 2021Updated 4 years ago
- SPIE Medical Imaging 2019 Notes By Hao☆16Feb 26, 2019Updated 7 years ago
- This repository helps you extract useful information from Openpose node publisher via comparing position of body nodes and estimate the g…☆16Jun 8, 2018Updated 7 years ago
- All you need to make an intercom with raspberry pi (not finished yet)☆14Feb 11, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data generation, model training and inference for Visual Font Recognition using PyTorch☆19Dec 5, 2023Updated 2 years ago
- CLI tool for Markdown files, offering formatting, AI-powered reviews, linting, spell checking, and link checking to streamline your Markd…☆19Jan 27, 2025Updated last year
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- FunAudioLLM homepage☆17Dec 11, 2024Updated last year
- English Georgian Dictionary for iPhone☆21Apr 19, 2018Updated 7 years ago
- Unified-Multimodal Transformer Pipeline for Political Content Creation: TikTok Reel Generator (Highlight detection + visually tracked ver…☆16May 15, 2023Updated 2 years ago
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated 11 months ago
- Lessons Learned from GPU Experiments with Aparapi☆13Apr 17, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [TPAMI2025] Code for my paper "Semi-Supervised Unconstrained Head Pose Estimation in the Wild"☆18Sep 25, 2025Updated 6 months ago
- Digital Audio Effects in JavaScript☆11Updated this week
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆21Nov 25, 2024Updated last year
- Perform OSINT on external targets using Shodan☆23Feb 7, 2024Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- FFT-based windowed spectrum analyzer☆13Mar 10, 2017Updated 9 years ago
- Coinlen is a cryptocurrency exchange tracking system. ♡ This project was built in Dumaguete City, Negros Oriental , Philippines. ♡☆14Jan 30, 2021Updated 5 years ago
- Continuous speech recognition for Android demo☆14Feb 20, 2024Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆20Mar 23, 2024Updated 2 years ago
- ☆62Jul 25, 2024Updated last year
- GTK3.0/gtkmm 3.0 CMake OpenCV 3.3 webcam integration☆11May 17, 2021Updated 4 years ago
- Movie streaming website with Java Spring☆10Oct 3, 2024Updated last year
- A generalized neural network model in JavaScript☆10Feb 29, 2016Updated 10 years ago
- FaceSystem项目在会议场景中支持人脸识别的会议签到系统,实现了基本的会议管理功能,参会人信息可以预先通过人脸信息进行录入,录入成功后,参会人即可进行人脸识别签到。☆11Mar 4, 2023Updated 3 years ago
- A Simple Authentication plugin for any Web Frameworks in Julia☆19Oct 25, 2023Updated 2 years ago