IBM / MAX-Speech-to-Text-ConverterLinks
Converts spoken words into text form.
โ76Updated 3 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below
Sorting:
- ๐ธTTS recipes for different datasetsโ86Updated 3 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkโ47Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.โ25Updated 4 years ago
- Automatic Speech Recognition Dataset Generationโ37Updated 6 years ago
- ๐ฆ Nala is an agile open-source voice assistant framework (20+ actions).โ35Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsโ102Updated 5 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.โ129Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneasโ53Updated 3 years ago
- 24-hour Automatic Speech Recognitionโ27Updated 4 years ago
- ๐ง Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)โ225Updated 5 years ago
- Build your own Real-time Speech Emotion Recognizerโ116Updated 6 years ago
- Mozilla deepspeech server implemented in django.โ49Updated 4 years ago
- โ65Updated 2 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signatureโ24Updated 5 years ago
- Speaker diarization via transfer learningโ27Updated 6 years ago
- Identify sounds in short audio clipsโ155Updated 3 months ago
- ๐ฎ Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.โ172Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corporaโ82Updated last year
- A testing server for a speech to text service based on coqui.aiโ216Updated 3 years ago
- Create a custom Watson Speech to Text model using specialized domain dataโ60Updated 3 years ago
- ๐ A web app to play, visualize, and annotate your audio files for machine learningโ120Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banksโ170Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningโ40Updated 3 years ago
- ๐ฃ๏ธ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).โ384Updated 2 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatโฆโ26Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.โ49Updated 8 years ago
- Generate embedding vectors from audio filesโ59Updated 3 months ago
- Python SDK for the Microsoft Speaker Recognition API, part of Cognitive Servicesโ109Updated 2 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The oโฆโ22Updated 7 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?โ33Updated 7 years ago