This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper or enhanced and custom datasets
☆32Nov 26, 2024Updated last year
Alternatives and similar repositories for Whisper-Synthetic-ASR-Dataset-Generator
Users that are interested in Whisper-Synthetic-ASR-Dataset-Generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Speech-to-text transcription VST3/ARA plugin☆60Apr 13, 2026Updated last month
- Numeric input control with step buttons for Semantic UI React☆11Jan 10, 2024Updated 2 years ago
- Xell Bootloader, rewritten in Rust because ¯\_(ツ)_/¯☆17Oct 30, 2021Updated 4 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- StpToStl an utility to Convert ISO 10303 STEP file (AP203, AP214) (.stp) to StereoLithography file (.stl)☆13Jul 3, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- boilerplate for developing Svelte app with Rust backend☆10Feb 4, 2023Updated 3 years ago
- Calculate the volume of a STEP file☆12Jun 17, 2021Updated 4 years ago
- A lightweight header-only c++ library for real time audio applications, oriented to the embedded world.☆18Jul 23, 2021Updated 4 years ago
- ComfyUI port of SDWebUI Vectorscope CC and Diffusion CG extensions☆21Feb 24, 2025Updated last year
- ☆19May 9, 2019Updated 7 years ago
- A zsh plugin that makes git worktrees much more functional☆17Feb 5, 2023Updated 3 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- ☆10Aug 3, 2019Updated 6 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Playing Commodore 64 SID Audio on Arduino☆14Oct 4, 2024Updated last year
- A utility to fetch and display dns names from the SSL/TLS cert data☆16Aug 11, 2023Updated 2 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- Hardware and support board schematics☆17Nov 10, 2016Updated 9 years ago
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- ESP32 based C64 keyboard to bluetooth adapter☆11May 22, 2020Updated 6 years ago
- QuickSearch bar component made with svelte and fuzzy search☆20Aug 30, 2024Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of helper scripts for Clojure, Java, Ledger and Taskwarrior. Written in Clojure.☆13Jun 2, 2023Updated 2 years ago
- ☆25Oct 11, 2024Updated last year
- Arduino/AVR C code for controlling the MOS6581 SID sound chip over MIDI☆10Oct 14, 2024Updated last year
- Rainbowgram with Python☆13Jan 28, 2019Updated 7 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- ☆27Jun 28, 2024Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- RPi program to use Bluetooth and/or USB gamepads and mice on retro 8/16-bit computers (C64, Amiga, etc)☆15Dec 11, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Dimensionality reduction (UMAP, t-SNE, PCA) for ImageJ/Fiji☆12May 6, 2025Updated last year
- Generates spectrogram from images☆13Apr 26, 2021Updated 5 years ago
- ☆12May 1, 2019Updated 7 years ago
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 8 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆10Jun 22, 2020Updated 5 years ago
- neo6502 ehbasic emulator☆17May 18, 2024Updated 2 years ago
- Using Deep Learning for singing voice separation - Project for the course DT2119 Speech and Speaker Recognition offered by KTH in 2018☆15Jun 16, 2018Updated 7 years ago