IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
β76Updated 3 weeks ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Web app for keyword spotting using TensorflowJSβ74Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ48Updated 2 years ago
- π A web app to play, visualize, and annotate your audio files for machine learningβ119Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β35Updated 2 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- Identify sounds in short audio clipsβ155Updated 3 weeks ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β130Updated 4 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- A testing server for a speech to text service based on coqui.aiβ217Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ180Updated 3 years ago
- Automatic Speech Recognition Dataset Generationβ37Updated 7 years ago
- π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).β384Updated 2 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Conversational AI Benchmark.β68Updated 2 years ago
- Generate embedding vectors from audio filesβ59Updated 3 weeks ago
- π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)β225Updated 5 years ago
- Create a custom Watson Speech to Text model using specialized domain dataβ59Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- Mozilla deepspeech server implemented in django.β49Updated 4 years ago
- Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.β52Updated 6 years ago
- β84Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ82Updated last year
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatβ¦β26Updated 6 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksβ170Updated last year
- Experiment with "one-shot learning" techniques to recognize a voice signatureβ24Updated 5 years ago
- Identifying people from small audio fragmentsβ170Updated 5 years ago
- Speaker diarization via transfer learningβ27Updated 6 years ago