Converts spoken words into text form.
☆76Sep 17, 2025Updated 6 months ago
Alternatives and similar repositories for MAX-Speech-to-Text-Converter
Users that are interested in MAX-Speech-to-Text-Converter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate English-language text similar to the text in the Yelp® review data set.☆17Sep 17, 2025Updated 6 months ago
- Generate English-language text similar to the news articles in the One Billion Words data set.☆26Sep 17, 2025Updated 6 months ago
- ☆17Mar 5, 2026Updated 3 weeks ago
- Generate a summarized description of a body of text☆27Sep 17, 2025Updated 6 months ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python package that can be installed to make it easier to create MAX models☆27May 10, 2021Updated 4 years ago
- Identify objects in an image, additionally assigning each pixel of the image to a particular object☆31Sep 17, 2025Updated 6 months ago
- Protect communications with adversarial neural cryptography.☆11Oct 31, 2018Updated 7 years ago
- Create a custom Watson Speech to Text model using specialized domain data☆59Aug 31, 2021Updated 4 years ago
- Generate a new image that mixes the content of a source image with the style of another image.☆52Sep 17, 2025Updated 6 months ago
- Image classifier for physical places/locations, based on the Places365-CNN Model☆42Sep 17, 2025Updated 6 months ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Jun 15, 2020Updated 5 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- ☆13Jun 22, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- Categorize sports videos according to which sport the video depicts.☆24Sep 17, 2025Updated 6 months ago
- Answer questions on a given corpus of text.☆33Sep 17, 2025Updated 6 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- Identify objects in images using a third-generation deep residual network.☆26Sep 17, 2025Updated 6 months ago
- IBM Code Model Asset Exchange: Show and Tell Image Caption Generator☆84Sep 17, 2025Updated 6 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Identify sounds in short audio clips☆157Sep 17, 2025Updated 6 months ago
- A CSS-only, resolution-independent "Fuck me on GitHub" ribbon.☆11Nov 2, 2025Updated 4 months ago
- Experiments with Hugging Face 🔬 🤗☆46Mar 18, 2026Updated last week
- Real-time speech enhancement based on spectral subtraction☆16Feb 18, 2018Updated 8 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Generate embedding vectors from audio files☆60Sep 17, 2025Updated 6 months ago
- Detect emotion from audio☆14Nov 20, 2018Updated 7 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆22Dec 31, 2025Updated 2 months ago
- Adds color to black and white images.☆26Sep 17, 2025Updated 6 months ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- Korean ASR Corpus generated from TEDx talks☆27Jan 11, 2019Updated 7 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Sep 17, 2025Updated 6 months ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago