Converts spoken words into text form.
☆77Sep 17, 2025Updated 8 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate English-language text similar to the text in the Yelp® review data set.☆17Sep 17, 2025Updated 8 months ago
- Generate English-language text similar to the news articles in the One Billion Words data set.☆26Sep 17, 2025Updated 8 months ago
- ☆17May 21, 2026Updated last week
- Generate a summarized description of a body of text☆27Sep 17, 2025Updated 8 months ago
- Train a neural network component that can add spatial transformations such as translation and rotation to larger models.☆10Apr 18, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python package that can be installed to make it easier to create MAX models☆27May 10, 2021Updated 5 years ago
- Identify objects in images using a first-generation deep residual network.☆15Sep 17, 2025Updated 8 months ago
- Identify objects in an image, additionally assigning each pixel of the image to a particular object☆31Sep 17, 2025Updated 8 months ago
- Protect communications with adversarial neural cryptography.☆11Oct 31, 2018Updated 7 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆60Aug 31, 2021Updated 4 years ago
- Generate personalized recommendations☆14Sep 17, 2025Updated 8 months ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆222Jun 15, 2020Updated 5 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- Answer questions on a given corpus of text.☆33Sep 17, 2025Updated 8 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- IBM Code Model Asset Exchange: Show and Tell Image Caption Generator☆82Sep 17, 2025Updated 8 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- A simple version of the MAX Object Detector Web App rewritten in python for use in the MAX tutorial☆10Mar 31, 2021Updated 5 years ago
- Identify sounds in short audio clips☆158Sep 17, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Android Push notifications SDK for IBM Cloud Mobile Services☆10Apr 30, 2021Updated 5 years ago
- Experiments with Hugging Face 🔬 🤗☆46Apr 18, 2026Updated last month
- Real-time speech enhancement based on spectral subtraction☆16Feb 18, 2018Updated 8 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Locate and tag named entities in text☆25Sep 17, 2025Updated 8 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- MAX Optical Character Recognition☆51Sep 17, 2025Updated 8 months ago
- Detect emotion from audio☆14Nov 20, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- A cookie-cutter / skeleton for MAX repos☆19Sep 17, 2025Updated 8 months ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Adds color to black and white images.☆26Sep 17, 2025Updated 8 months ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- ☆11Aug 29, 2022Updated 3 years ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago