All-in-one Speech Transcription
☆10Jan 25, 2026Updated last month
Alternatives and similar repositories for PromptingNemo
Users that are interested in PromptingNemo are comparing it to the libraries listed below
Sorting:
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- Tensorflow-based wake word detection☆17Jan 29, 2026Updated last month
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated last month
- A simple, but performant framework for mapping speech directly to categories and intents.☆25Aug 8, 2024Updated last year
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated 10 months ago
- Small Language Model Inference, Fine-Tuning and Observability. No GPU, no labeled data needed.☆88Feb 3, 2026Updated 3 weeks ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆55Feb 11, 2026Updated 2 weeks ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆63Dec 23, 2025Updated 2 months ago
- Text Normalization utilities for normalizing text for TTS☆21Updated this week
- Node-RED Flow (and web page example) for the LLaMA AI model☆11Jul 27, 2023Updated 2 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆51Apr 18, 2025Updated 10 months ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- Hanya dokumentasi bagaimana menggunakan opencv pada python.☆12Updated this week
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- CRUD with Authentication and Authorization using Get x cli pattern and Supabase☆12Nov 5, 2023Updated 2 years ago
- Depenency free (so far) Vanilla JS Dashboard UI for the mediamtx streaming server. Dockerized.☆32Feb 2, 2026Updated 3 weeks ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- Whisper finetuning☆16Apr 9, 2025Updated 10 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Big Data Inventory Management on AWS (Demand Forecasting, Machine Learning, Dashboarding) : Presented at Carlson School of Management dur…☆11Apr 15, 2020Updated 5 years ago
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- Docker powered container for using Nginx as reverse-proxy in combination with an OpenVPN Client.☆11Jan 1, 2020Updated 6 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- A simple python script to follow stock market papers in your portfolio☆12Jun 29, 2020Updated 5 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Easy to use Dynamic Theme for Flutter with automatic persistence support.☆10Jul 30, 2025Updated 7 months ago
- 青岛船舶检测☆13Apr 16, 2025Updated 10 months ago
- My first ever training of a piper tts voice☆16May 23, 2025Updated 9 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago