IBM / MAX-Speech-to-Text-Converter
Converts spoken words into text form.
☆76Updated 7 months ago
Related projects: ⓘ
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆47Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆101Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Speech Commands Recognition in PyTorch☆34Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.☆23Updated 3 years ago
- Identifying people from small audio fragments☆169Updated 4 years ago
- Mozilla deepspeech server implemented in django.☆49Updated 3 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆152Updated 2 months ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆25Updated 5 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.☆49Updated 5 years ago
- speaker diarization system using an LSTM☆49Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.x☆70Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Tool for creation, manipulation and maintenance of voice corpora☆80Updated 4 months ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆176Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆222Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆19Updated 5 years ago
- Automatic Speech Recognition Dataset Generation☆36Updated 6 years ago
- Automatic Speaker Recognition algorithms in Python☆92Updated 2 years ago
- CMPT726 Machine Learning Final Project☆11Updated 5 years ago
- Speaker diarization via transfer learning☆27Updated 5 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆128Updated 3 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆48Updated 5 years ago
- DeepSpeech based forced alignment tool☆232Updated 3 years ago
- End-to-end spoken language identification out of the box.☆48Updated 3 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Updated 4 years ago