thevasudevgupta/gsoc-wav2vec2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thevasudevgupta/gsoc-wav2vec2)

thevasudevgupta / gsoc-wav2vec2

GSoC'2021 | TensorFlow implementation of Wav2Vec2

☆91

Alternatives and similar repositories for gsoc-wav2vec2

Users that are interested in gsoc-wav2vec2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sayakpaul / BiT-jax2tf
View on GitHub
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Dec 21, 2021Updated 4 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
View on GitHub
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆89Sep 22, 2022Updated 3 years ago
sayakpaul / CI-CD-for-Model-Training
View on GitHub
This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.
☆20Oct 26, 2021Updated 4 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
soumik12345 / tf2_gans
View on GitHub
Implementations of GANs in Tensorflow 2.x
☆16Feb 12, 2022Updated 4 years ago
thevasudevgupta / speech-jax
View on GitHub
Speech in Flax/JAX
☆14Jul 11, 2022Updated 4 years ago
facebookresearch / grounding-inductive-biases
View on GitHub
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Sep 25, 2024Updated last year
ina-foss / InaGVAD
View on GitHub
Voice activity detection and speaker gender segmentation audiovisual corpus
☆16Jan 20, 2025Updated last year
lwang114 / UnsupTTS
View on GitHub
☆37Mar 26, 2024Updated 2 years ago
fanlu / wenet
View on GitHub
Transformer based ASR Engine.
☆13Aug 23, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tts-tutorial / interspeech2022
View on GitHub
☆162Sep 19, 2022Updated 3 years ago
farmaker47 / OCR_with_Keras
View on GitHub
☆16Aug 22, 2021Updated 4 years ago
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
yistLin / universal-vocoder
View on GitHub
A PyTorch implementation of the universal neural vocoder
☆68Nov 6, 2020Updated 5 years ago
sayakpaul / Handwriting-Recognizer-in-Keras
View on GitHub
This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.
☆13Aug 15, 2021Updated 4 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
deepakbaby / isegan
View on GitHub
Improved Speech Enhancement GANs
☆13Jun 24, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
deep-diver / complete-mlops-system-workflow
View on GitHub
☆17Sep 9, 2022Updated 3 years ago
b04901014 / FG-transformer-TTS
View on GitHub
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆90Mar 5, 2022Updated 4 years ago
TeaPoly / Conformer-Athena
View on GitHub
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Nov 2, 2022Updated 3 years ago
taskswithcode / sota_researchers_with_published_code
View on GitHub
Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper
☆12Oct 19, 2023Updated 2 years ago
TensorSpeech / TensorFlowASR
View on GitHub
TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…
☆1,009Updated this week
babua / TTSDatasetRecorder
View on GitHub
A simple app for recording speech datasets.
☆26Jun 27, 2022Updated 4 years ago
asappresearch / sew
View on GitHub
☆77Oct 25, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
qcri / ArabicASRChallenge2016
View on GitHub
This repository
☆32Nov 13, 2022Updated 3 years ago
trecpodcasts / podcast-audio-feature-extraction
View on GitHub
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Sep 30, 2021Updated 4 years ago
iamjanvijay / rnnt
View on GitHub
An implementation of RNN-Transducer loss in TF-2.0.
☆46Jan 7, 2026Updated 6 months ago
AdityaKane2001 / regnety
View on GitHub
Implementation of RegNetY in TensorFlow 2
☆21Jan 15, 2023Updated 3 years ago
monatis / german-tts
View on GitHub
German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support
☆26Jun 7, 2021Updated 5 years ago
techiaith / docker-huggingface-stt-cy
View on GitHub
Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace
☆13Nov 29, 2022Updated 3 years ago
sidleal / porsimplessent
View on GitHub
PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment
☆13Jan 15, 2020Updated 6 years ago