This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset 🤗. Fine tune Whisper or enhanced and custom datasets
☆32Nov 26, 2024Updated last year
Alternatives and similar repositories for Whisper-Synthetic-ASR-Dataset-Generator
Users that are interested in Whisper-Synthetic-ASR-Dataset-Generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.☆11Updated this week
- Speech-to-text transcription VST3/ARA plugin☆61Jun 8, 2026Updated last week
- A Python package for converting numbers expressed in natural language to numerical values.☆13Nov 25, 2023Updated 2 years ago
- A real time offline transcriber with gui, based on OpenAI whisper☆17Dec 25, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆34May 15, 2023Updated 3 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- A lightweight header-only c++ library for real time audio applications, oriented to the embedded world.☆18Jul 23, 2021Updated 4 years ago
- ComfyUI port of SDWebUI Vectorscope CC and Diffusion CG extensions☆21Feb 24, 2025Updated last year
- Notion Database Operator☆14Sep 30, 2025Updated 8 months ago
- Custom node for ComfyUI. Add a node for drawing text to the area of SEGS.☆14Mar 30, 2025Updated last year
- A helper to generate the READE file automatically from YAML-based metadata files.☆19May 23, 2024Updated 2 years ago
- Human body part segmentation model, trained with 22 class labels.☆17Sep 28, 2023Updated 2 years ago
- ☆19May 9, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆22Apr 10, 2026Updated 2 months ago
- Automated content cross posting from Notion Database to Dev.to, Hashnode, Medium, Twitter, and LinkedIn using GitHub Actions.☆13Oct 21, 2024Updated last year
- 一个极简高效的 Windows 语音输入助手,基于 Rust 开发。支持全局热键一键录音上屏、灵动岛式状态悬浮窗、自动标点与中英文混合识别。轻量、无感、隐私安全。 Minimalist, high-performance voice-to-text assistant f…☆40Jun 5, 2026Updated last week
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- ☆10Aug 3, 2019Updated 6 years ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- 一个用Go语言编写的现代化桌面应用,支持多种 AI CLI 工具(Claude Code、Codex、Gemini CLI)的环境变量配置管理。本工具采用现代 Bento Grid 设计风格,使用Wails框架构建,提供简洁优雅的用户界面。☆22Mar 8, 2026Updated 3 months ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Playing Commodore 64 SID Audio on Arduino☆14Oct 4, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Scripts to convert audio files to spectrograms and back☆12Nov 23, 2017Updated 8 years ago
- WaveGANによる音声生成器☆13Feb 9, 2024Updated 2 years ago
- SGen is a generator capable of producing efficient hardware designs operating on streaming datasets. “Streaming” means that the dataset i…☆26Nov 11, 2025Updated 7 months ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- Hardware and support board schematics☆17Nov 10, 2016Updated 9 years ago
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- A collection of helper scripts for Clojure, Java, Ledger and Taskwarrior. Written in Clojure.☆13Jun 2, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Arduino/AVR C code for controlling the MOS6581 SID sound chip over MIDI☆11Oct 14, 2024Updated last year
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- ☆27Jun 28, 2024Updated last year
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- RPi program to use Bluetooth and/or USB gamepads and mice on retro 8/16-bit computers (C64, Amiga, etc)☆15Dec 11, 2020Updated 5 years ago
- CNN-to-FPGA-framework for small CNN, written in VHDL and Python☆24Jun 8, 2021Updated 5 years ago
- Dimensionality reduction (UMAP, t-SNE, PCA) for ImageJ/Fiji☆12May 6, 2025Updated last year