ducanhdt/openai_whisper_finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ducanhdt/openai_whisper_finetuning)

ducanhdt / openai_whisper_finetuning

☆49

Alternatives and similar repositories for openai_whisper_finetuning

Users that are interested in openai_whisper_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nguyenvulebinh / lyric-alignment
View on GitHub
Vietnamese song lyric alignment framework
☆68Dec 11, 2022Updated 3 years ago
langmaninternet / VietnameseTextNormalizer
View on GitHub
Thư viện chuẩn hóa văn bản Tiếng Việt
☆180May 26, 2025Updated last year
khuyentran1401 / prefect-dvc
View on GitHub
☆23Nov 1, 2022Updated 3 years ago
ducnh279 / LLMs-Pretraining-with-PyTorch
View on GitHub
Code example for pretraining an LLM with vanilla PyTorch training loop
☆10Jun 6, 2024Updated 2 years ago
lampts / chatgpt-mle-interview
View on GitHub
ChatGPT solutions for the MLE interview
☆14Dec 9, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
protonx-tf-06-projects / lora-experiment-1
View on GitHub
Use LoRA technique to improve training Large Language Model
☆13Jul 25, 2023Updated 2 years ago
ryuryukke / japanese_summarizer
View on GitHub
A summarizer for Japanese articles (but ChatGPT is better)
☆10Aug 1, 2022Updated 3 years ago
tonhathuy / tensorrt-triton-magface
View on GitHub
Magface Triton Inferece Server Using Tensorrt
☆19Feb 12, 2022Updated 4 years ago
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
Reasoning-Lab / Elementary-Math-Solving-Zalo-AI-2023
View on GitHub
Baseline for ZaloAI Challenge 2023 Elementary Math Solving
☆67Jan 22, 2024Updated 2 years ago
vnk8071 / ZAIC2022-Lyric-Alignment
View on GitHub
Top 9 private leaderboard & Top 17 public leaderboard
☆10Dec 1, 2022Updated 3 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
igormq / speech2text
View on GitHub
☆12Feb 9, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
anhvth / WKaraokeMaker
View on GitHub
☆12Dec 15, 2022Updated 3 years ago
ngoanpv / llama2_vietnamese
View on GitHub
A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.
☆18Sep 12, 2023Updated 2 years ago
EraX-JS-Company / erax-vl-7b-v1
View on GitHub
EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.
☆13Dec 31, 2024Updated last year
bangoc123 / transformer
View on GitHub
Build English-Vietnamese machine translation with ProtonX Transformer. :D
☆75Sep 13, 2021Updated 4 years ago
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
Telegram-Zalo / zac2022-lyric-alignment
View on GitHub
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
☆68Dec 5, 2022Updated 3 years ago
siddsriv / Image-captioning
View on GitHub
Using a CNN-LSTM hybrid network to generate captions for images
☆18Nov 19, 2019Updated 6 years ago
unibuc-cs / game-testing
View on GitHub
Prototype for a game testing framework using AI methods
☆10Feb 25, 2023Updated 3 years ago
NoSavedDATA / Neve
View on GitHub
NSK Coding Language: Fast and Simple
☆15Jul 6, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ANonEntity / WhisperWithVAD
View on GitHub
Whisper combined with Silero VAD, for improved long-form transcriptions
☆55Dec 11, 2022Updated 3 years ago
habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
chester-w-xie / FCAC_datasets
View on GitHub
Details of the datasets for Few-shot class-incremental audio classification
☆10Dec 6, 2023Updated 2 years ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 8 months ago
bangoc123 / tensorflow-js
View on GitHub
TensorFlow JS Experiments at Google I/O Extended Hanoi 2018
☆20Jan 4, 2019Updated 7 years ago
PalabraAI / redimnet2
View on GitHub
This repository contains the official implementation and pretrained weights for the paper "ReDimNet2: Scaling Speaker Verification via Ti…
☆65Jul 9, 2026Updated last week
cuongducle / codex-linux
View on GitHub
Install OpenAI Codex Desktop on Linux — unofficial .deb and AppImage packages with Wayland support, auto-updates, and APT repository for …
☆21Updated this week
albertnguyen97 / coursera-free
View on GitHub
☆47Nov 7, 2023Updated 2 years ago
TorbenHellriegel / Speaker-Recognition-x-vectors
View on GitHub
This is my speaker recognition implementation based on the x-vector system described in "X-Vectors: Robust DNN Embeddings for Speaker Rec…
☆11Nov 3, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thangnch / MiAI_Airflow
View on GitHub
Demo of using Airflow
☆11Jun 24, 2022Updated 4 years ago
msalhab96 / Conformer
View on GitHub
An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper
☆20Aug 16, 2022Updated 3 years ago
JinhuaLiang / LaD-ProtoNet
View on GitHub
☆16Sep 14, 2023Updated 2 years ago
ChedySmaoui / MyPythonStockPicker
View on GitHub
Full code for my Medium article on how I code a simple Python Stock Screen.
☆12Apr 17, 2024Updated 2 years ago
ds4v / absa-vlsp-2018
View on GitHub
End-to-end Multi-task Solutions for Aspect Category Sentiment Analysis (ACSA) on Vietnamese reviews, using PhoBERT as pretrained model
☆33Jul 9, 2024Updated 2 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
deep-privacy / SA-toolkit
View on GitHub
SA-toolkit: Speaker speech anonymization toolkit in python
☆33Sep 18, 2025Updated 10 months ago