raj-sutariya/gujarati_speech_recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/raj-sutariya/gujarati_speech_recognition)

raj-sutariya / gujarati_speech_recognition

Offline speech recognition for Gujarati Language.

☆22

Alternatives and similar repositories for gujarati_speech_recognition

Users that are interested in gujarati_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

raj-sutariya / indic-num2words
View on GitHub
Python library for converting numbers to words for all Indian Languages.
☆38May 23, 2025Updated last year
lifelongeek / AAS_enhancement
View on GitHub
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Oct 10, 2019Updated 6 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
manymuch / Natural-Noise-Generator
View on GitHub
☆10Aug 3, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
altruist7 / AIChallenge
View on GitHub
☆10Jun 24, 2020Updated 6 years ago
rhasspy / wav2mel
View on GitHub
Transform audio files into mel spectrograms for text-to-speech model training
☆12Aug 25, 2021Updated 4 years ago
GPUPhobia / vocal-mask
View on GitHub
☆12May 1, 2019Updated 7 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
muhdhuz / audio2spec
View on GitHub
Scripts to convert audio files to spectrograms and back
☆12Nov 23, 2017Updated 8 years ago
zassou65535 / WaveGAN
View on GitHub
WaveGANによる音声生成器
☆13Feb 9, 2024Updated 2 years ago
ronggong / phoneticSimilarity
View on GitHub
phonetic similarity algorithms
☆13Jun 19, 2018Updated 8 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
unreal79 / pic2wav
View on GitHub
Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.
☆11Jan 25, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
d3n7 / riffusionPrepper
View on GitHub
Prepare spectrograms from audio for training a Riffusion model
☆16Mar 6, 2023Updated 3 years ago
tarepan / rainbowgram
View on GitHub
Rainbowgram with Python
☆13Jan 28, 2019Updated 7 years ago
sagiebenaim / Singing
View on GitHub
☆19May 9, 2019Updated 7 years ago
antinos / Dimensionality_Reduction-ImageJ
View on GitHub
Dimensionality reduction (UMAP, t-SNE, PCA) for ImageJ/Fiji
☆12May 6, 2025Updated last year
adrianbarahona / conditional_wavegan_knocking_sounds
View on GitHub
Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.
☆10Jun 22, 2020Updated 6 years ago
ryoasu / grad-cam
View on GitHub
Grad-CAM (Gradient-weighted Class Activation Mapping)
☆13Dec 20, 2019Updated 6 years ago
yoyolicoris / wavenet-like-vocoder
View on GitHub
Basic wavenet and fftnet vocoder model.
☆19Feb 7, 2022Updated 4 years ago
penzant / nlu_datasets_2018
View on GitHub
☆12Nov 9, 2018Updated 7 years ago
SequenceJS / modern-slide-in
View on GitHub
Sequence.js Theme - A minimalist theme for showcasing products
☆10Aug 21, 2015Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mpuels / docker-py-kaldi-asr-and-model
View on GitHub
STT Service based on Kaldi ASR
☆15Aug 17, 2018Updated 7 years ago
evigog / VocalSeparation
View on GitHub
Using Deep Learning for singing voice separation - Project for the course DT2119 Speech and Speaker Recognition offered by KTH in 2018
☆15Jun 16, 2018Updated 8 years ago
phorward / xpl
View on GitHub
An eXample Programming Language
☆11Dec 20, 2018Updated 7 years ago
kjanjua26 / Sound_Classification_Spectrograms
View on GitHub
This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…
☆10Dec 14, 2018Updated 7 years ago
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
fractalego / zero-shot-relation-extractor
View on GitHub
A zero-shot relation extractor, easily downloadable from the HuggingFace repo.
☆12Aug 13, 2021Updated 4 years ago
sungnyun / avsr-temporal-dynamics
View on GitHub
(SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition
☆13Oct 22, 2024Updated last year
JinScientist / voice-gender-recognition
View on GitHub
Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender
☆13Mar 26, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
yingtaoluo / Complex-Wavelet-Inception-GAN-Audio-Synthesis
View on GitHub
☆16Jan 20, 2021Updated 5 years ago
arnimarj / py-judy
View on GitHub
☆14Apr 13, 2026Updated 3 months ago
ixobert / birds-generation
View on GitHub
☆13Apr 22, 2024Updated 2 years ago
sacOO7 / socketcluster-client-C
View on GitHub
C/ C++ client for socketcluster framework in node.js
☆13Jun 2, 2021Updated 5 years ago
cobanov / audio-embedding
View on GitHub
Extract audio embeddings from an audio file using Python
☆13Jul 25, 2023Updated 3 years ago
han51 / nafld-1d-cnn
View on GitHub
1D-CNN models for NAFLD diagnosis and liver fat fraction quantification using radiofrequency ultrasound signals
☆13Jun 10, 2020Updated 6 years ago