maneesh-chouksey / speech-to-text-deep-learning-modelsView external linksLinks
Speech to text models; Bidirectional RNN, Attention Model and WaveNet models
☆11Nov 6, 2018Updated 7 years ago
Alternatives and similar repositories for speech-to-text-deep-learning-models
Users that are interested in speech-to-text-deep-learning-models are comparing it to the libraries listed below
Sorting:
- Tiny, easy, comfy and tough DIY SlimeVR's - PCB Gerbers, 3D Print Files, Shopping list and guide for assembly☆19Nov 10, 2025Updated 3 months ago
- Explaining audio differences using language☆16Feb 11, 2025Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- ☆15May 24, 2025Updated 8 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- Alfresco tests☆13Jan 27, 2026Updated 2 weeks ago
- Listing my favorite research papers 📝 from different fields as I read them.☆10Oct 17, 2019Updated 6 years ago
- ☆11Sep 25, 2024Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- Implementation of Transformer-based Text-to-Speech models from scratch to enhance speech synthesis, focusing on delivering more natural …☆11Feb 17, 2025Updated 11 months ago
- Room type classification☆14Jan 26, 2024Updated 2 years ago
- ☆13Jan 11, 2025Updated last year
- Inception v3-based convolutional neural network model for mutli-label image classification of photographs of apartment rooms.☆10Sep 23, 2022Updated 3 years ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆17Dec 10, 2024Updated last year
- Starter app that fetches orders from a Shopify store (via private app access token) using the GraphQL API.☆13Dec 8, 2022Updated 3 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆15Sep 24, 2025Updated 4 months ago
- A fine tuned IndoBERT model for University Sentiment On Social Media☆13Jun 3, 2025Updated 8 months ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- 🎙 Speech transcription and synthesis via Keras and Tensorflow.☆13Apr 1, 2018Updated 7 years ago
- Generating Video Caption Using LSTM☆12May 29, 2023Updated 2 years ago
- Cross-platform Bluetooth LE library for MAUI, Xamarin, Windows, and Linux applications☆14Oct 28, 2022Updated 3 years ago
- CS 2.2: Advanced Recursion and Graphs – Course Syllabus and Lessons☆10Jun 7, 2021Updated 4 years ago
- A CLI built in Node.js, to automate the process of creating a rest api / sockets backend basics☆11Mar 24, 2022Updated 3 years ago
- Sequelize's docs sucked, so I wrote new ones. https://ajbraus.github.io/sequelize-it/#/☆12Nov 12, 2020Updated 5 years ago
- ☆13Jul 14, 2024Updated last year
- Official code for "Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis"☆56Feb 3, 2026Updated last week
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- Neural Machine Translator for translating from english to hindi text. Used Pytorch framework with seq2seq architecture having Attention f…☆13Jan 21, 2019Updated 7 years ago
- I worked on this project with Guanghan Pan on July 2019 as a mini-research project for Professor Scharstein. The idea is to set up SteamV…☆15Aug 3, 2019Updated 6 years ago
- End-to-End Speech Recognition☆12Mar 2, 2021Updated 4 years ago
- Brief introduction of Social Network Analysis (SNA) and its implementation on Twitter Network☆14Oct 2, 2020Updated 5 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- ☆14Sep 11, 2021Updated 4 years ago
- Odoo, configured for cloud-native production deployment (Docker, Redis, PostgreSQL)☆12Mar 31, 2023Updated 2 years ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16May 19, 2023Updated 2 years ago
- ☆18Nov 6, 2018Updated 7 years ago
- ☆13Apr 12, 2022Updated 3 years ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated 11 months ago