mxkrn / onnxruntime-web-tutorial
Browser-native machine learning app using ONNX Runtime Web
☆26Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for onnxruntime-web-tutorial
- ☆32Updated 2 years ago
- ☆23Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last month
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- Generate embedding vectors from audio files☆56Updated last year
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆11Updated 6 months ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆17Updated last year
- Hifi-like Vocoder implemented in PyTorch☆12Updated 2 years ago
- SDX23 startkit for the Demucs baselines.☆24Updated last year
- ☆22Updated 3 years ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆21Updated 6 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆11Updated 3 months ago
- Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition☆23Updated 5 years ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- ☆17Updated 3 years ago
- ☆11Updated 3 years ago
- A collection of utilities for handling IPA phones.☆24Updated last year
- ☆59Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- Landing Page for Divide and Remaster v3☆13Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- ☆17Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- ☆12Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year