m3hrdadfi/soxan

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/m3hrdadfi/soxan)

m3hrdadfi / soxan

Wav2Vec for speech recognition, classification, and audio classification

☆276

Alternatives and similar repositories for soxan

Users that are interested in soxan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆153Oct 26, 2021Updated 4 years ago
Edresson / Wav2Vec-Wrapper
View on GitHub
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆80May 20, 2023Updated 3 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
Data-Science-kosta / Speech-Emotion-Classification-with-PyTorch
View on GitHub
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
☆213Nov 10, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
shangeth / wavencoder
View on GitHub
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆92Jun 6, 2021Updated 5 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
SuperKogito / SER-datasets
View on GitHub
A collection of datasets for the purpose of emotion recognition/detection in speech.
☆420Sep 30, 2024Updated last year
cristinalunaj / MMEmotionRecognition
View on GitHub
Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS …
☆111Mar 29, 2024Updated 2 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
ASR-project / Multilingual-PR
View on GitHub
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…
☆266May 9, 2022Updated 4 years ago
oliverguhr / wav2vec2-live
View on GitHub
A live speech recognition using Facebooks wav2vec 2.0 model.
☆379Feb 4, 2024Updated 2 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,556Mar 12, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HLasse / multidiagnosis-speech
View on GitHub
☆10Jun 23, 2023Updated 3 years ago
khanld / ASR-Wav2vec-Finetune
View on GitHub
Finetune Wa2vec 2.0 For Speech Recognition
☆150Feb 6, 2025Updated last year
anton-l / wav2vec-toolkit
View on GitHub
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆30Apr 21, 2021Updated 5 years ago
jonatasgrosman / wav2vec2-sprint
View on GitHub
☆206Feb 22, 2022Updated 4 years ago
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago
shamanez / BERT-like-is-All-You-Need
View on GitHub
The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…
☆121Feb 26, 2021Updated 5 years ago
audeering / w2v2-how-to
View on GitHub
How to use our public wav2vec2 dimensional emotion model
☆555May 22, 2023Updated 3 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
m3hrdadfi / sentence-transformers
View on GitHub
Sentence Embeddings with ParsBERT
☆56Jun 28, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TideDancer / interspeech21_emotion
View on GitHub
☆111Aug 10, 2022Updated 3 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
flaviorainhoavila / IEMOCAPspeechEmotionRecognition
View on GitHub
Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET
☆27Mar 11, 2022Updated 4 years ago
eastonYi / wav2vec
View on GitHub
a simplified version of wav2vec(1.0, vq, 2.0) in fairseq
☆170Sep 21, 2020Updated 5 years ago
EIHW / EmoNet
View on GitHub
☆29Mar 8, 2022Updated 4 years ago
Kyoto-University-Speech-and-Audio / feng-asr-ser
View on GitHub
☆10Sep 6, 2020Updated 5 years ago
huggingface / speechbox
View on GitHub
☆358Mar 17, 2024Updated 2 years ago
georgian-io / Knowledge-Distillation-Toolkit
View on GitHub
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆138Feb 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nikvaessen / w2v2-speaker
View on GitHub
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆144May 10, 2022Updated 4 years ago
jumon / whisper-finetuning
View on GitHub
[WIP] Scripts for fine-tuning Whisper
☆221Jul 2, 2026Updated 3 weeks ago
yanghaha0908 / FastHuBERT
View on GitHub
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆100Nov 20, 2024Updated last year
thevasudevgupta / gsoc-wav2vec2
View on GitHub
GSoC'2021 | TensorFlow implementation of Wav2Vec2
☆91Jan 11, 2022Updated 4 years ago
Demfier / multimodal-speech-emotion-recognition
View on GitHub
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
☆450Dec 21, 2023Updated 2 years ago
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year