i4Ds / whisper-finetuneView external linksLinks
This repository contains code for fine-tuning the Whisper speech-to-text model.
☆20Jan 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for whisper-finetune
Users that are interested in whisper-finetune are comparing it to the libraries listed below
Sorting:
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- Skin cancer classification project using deep learning techniques for automated diagnosis of skin lesions.☆11Jun 2, 2024Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Jul 21, 2024Updated last year
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- ☆23Updated this week
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 10 months ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆35Aug 26, 2025Updated 5 months ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆20Oct 8, 2025Updated 4 months ago
- ☆12Feb 19, 2025Updated 11 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 5 months ago
- SAMatch: SAM-Guided and Match-based Semi-Supervised Segmentation for Medical Image☆13Jul 18, 2025Updated 6 months ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Tacotron 2 training notebook supporting Japanese, French, and Mandarin☆11Nov 19, 2022Updated 3 years ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- ☆11Aug 29, 2025Updated 5 months ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆12Dec 25, 2025Updated last month
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- ☆16Jan 6, 2025Updated last year
- (WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.☆18Sep 10, 2025Updated 5 months ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆14Oct 24, 2024Updated last year
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 7 months ago
- Logo detection in images using SSD☆10Jul 13, 2018Updated 7 years ago
- General information about DEEP BERLIN's AI for Good Hackathon 2020☆11Apr 14, 2020Updated 5 years ago
- 经过强化的goose3通用网页提取器(添加作者VX: 862187570 , Python交流学习)☆16Nov 18, 2021Updated 4 years ago
- Kaggle notebook for Fooocus☆11Jun 16, 2025Updated 8 months ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- The implemented code of RAMEM, Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global Pixels.☆14Aug 16, 2023Updated 2 years ago
- An inmediate mode GUI that works on top of Canvas2D (it can also work in WebGL)☆13Apr 21, 2021Updated 4 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- The official website of drawjs☆10Jan 17, 2025Updated last year
- ☆18Dec 27, 2025Updated last month
- Open-source API for Touch Sensors☆13Aug 19, 2025Updated 5 months ago
- EAST-inspired Tensorflow-based Text Detector☆11Feb 18, 2021Updated 4 years ago
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆12May 8, 2024Updated last year