Speech to text models; Bidirectional RNN, Attention Model and WaveNet models
β11Nov 6, 2018Updated 7 years ago
Alternatives and similar repositories for speech-to-text-deep-learning-models
Users that are interested in speech-to-text-deep-learning-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β14Sep 11, 2021Updated 4 years ago
- Listing my favorite research papers π from different fields as I read them.β10Oct 17, 2019Updated 6 years ago
- A fine tuned IndoBERT model for University Sentiment On Social Mediaβ14Jun 3, 2025Updated 9 months ago
- Starter app that fetches orders from a Shopify store (via private app access token) using the GraphQL API.β13Dec 8, 2022Updated 3 years ago
- A CLI built in Node.js, to automate the process of creating a rest api / sockets backend basicsβ11Mar 24, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Room type classificationβ14Jan 26, 2024Updated 2 years ago
- Brief introduction of Social Network Analysis (SNA) and its implementation on Twitter Networkβ14Oct 2, 2020Updated 5 years ago
- π Speech transcription and synthesis via Keras and Tensorflow.β13Apr 1, 2018Updated 7 years ago
- β13Jan 11, 2025Updated last year
- A short walkthrough showing how to use Watson TTS with different language models.β18Aug 16, 2020Updated 5 years ago
- Odoo, configured for cloud-native production deployment (Docker, Redis, PostgreSQL)β12Mar 31, 2023Updated 2 years ago
- Inception v3-based convolutional neural network model for mutli-label image classification of photographs of apartment rooms.β10Sep 23, 2022Updated 3 years ago
- Pedagogy is a feedback-driven performance management app for education professionals built with Flask, Altair (Altair-viz) and pandasβ23Apr 7, 2023Updated 2 years ago
- Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)β24Nov 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tiny, easy, comfy and tough DIY SlimeVR's - PCB Gerbers, 3D Print Files, Shopping list and guide for assemblyβ20Updated this week
- Implementation of Transformer-based Text-to-Speech models from scratch to enhance speech synthesis, focusing on delivering more natural β¦β11Feb 17, 2025Updated last year
- β16May 24, 2025Updated 10 months ago
- β20Nov 6, 2018Updated 7 years ago
- Alfresco testsβ13Jan 27, 2026Updated 2 months ago
- β13Apr 12, 2022Updated 3 years ago
- Sequelize's docs sucked, so I wrote new ones. https://ajbraus.github.io/sequelize-it/#/β12Nov 12, 2020Updated 5 years ago
- Generating Video Caption Using LSTMβ12May 29, 2023Updated 2 years ago
- Cross-platform Bluetooth LE library for MAUI, Xamarin, Windows, and Linux applicationsβ14Oct 28, 2022Updated 3 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Explaining audio differences using languageβ16Feb 11, 2025Updated last year
- Neural Machine Translator for translating from english to hindi text. Used Pytorch framework with seq2seq architecture having Attention fβ¦β13Jan 21, 2019Updated 7 years ago
- A warehouse 3D model based on a specific data structure. Model is drawn by ThreeJSβ24Jun 13, 2021Updated 4 years ago
- CS 2.2: Advanced Recursion and Graphs β Course Syllabus and Lessonsβ10Jun 7, 2021Updated 4 years ago
- β10Oct 16, 2025Updated 5 months ago
- Repository for "Training Audio Captioning Models without Audio"β10Sep 26, 2023Updated 2 years ago
- I worked on this project with Guanghan Pan on July 2019 as a mini-research project for Professor Scharstein. The idea is to set up SteamVβ¦β15Aug 3, 2019Updated 6 years ago
- β24Apr 22, 2025Updated 11 months ago
- β10Sep 25, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Valve's Lighthouse v2 positioning system decoder implemented for the RP2040β22Oct 1, 2025Updated 5 months ago
- End-to-End Speech Recognitionβ12Mar 2, 2021Updated 5 years ago
- β11Dec 28, 2023Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioningβ15Jun 23, 2024Updated last year
- β18Aug 8, 2021Updated 4 years ago
- Audio Entailment: Deductive Reasoning for Audio Understandingβ17Dec 10, 2024Updated last year
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most sβ¦β18Jul 14, 2019Updated 6 years ago