kehanlu/Mandarin-Wav2Vec2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kehanlu/Mandarin-Wav2Vec2)

kehanlu / Mandarin-Wav2Vec2

Pre-trained Wav2vec2.0 for Mandarin

☆43

Alternatives and similar repositories for Mandarin-Wav2Vec2

Users that are interested in Mandarin-Wav2Vec2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Alfred0622 / HypR
View on GitHub
A benchmark corpus for ASR hypothesis revising task
☆21Sep 26, 2023Updated 2 years ago
kehanlu / server-monitor
View on GitHub
A light webserver for monitoring RAM and GPU usage on multiple servers.
☆21Mar 31, 2021Updated 5 years ago
kehanlu / python
View on GitHub
臺科大程式設計社 2019 spring
☆25May 28, 2019Updated 7 years ago
kehanlu / University
View on GitHub
臺科併校小幫手 🍡
☆13Apr 21, 2023Updated 3 years ago
kehanlu / Speech-IFEval
View on GitHub
Leaderboard and code for "Speech-IFEval", Interspeech 2025
☆24May 27, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kehanlu / DeSTA2
View on GitHub
Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"
☆127Jul 15, 2025Updated last year
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
cristinae / ASRdys
View on GitHub
ASR for dysarthric speakers with Kaldi
☆13Jan 14, 2017Updated 9 years ago
eastonYi / wav2vec
View on GitHub
a simplified version of wav2vec(1.0, vq, 2.0) in fairseq
☆170Sep 21, 2020Updated 5 years ago
s920128 / NAR-BERT-ASR
View on GitHub
NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
pyf98 / speech-model-compression
View on GitHub
A collection of papers related to speech model compression
☆27Jul 31, 2023Updated 2 years ago
voidful / SpeechMix
View on GitHub
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆46Jul 3, 2025Updated last year
narihira2000 / GAS-NTUST-bulletin
View on GitHub
一個透過Google App Script發送台科公佈欄資訊的機器人
☆23Sep 22, 2022Updated 3 years ago
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
mailong25 / self-supervised-speech-recognition
View on GitHub
speech to text with self-supervised learning based on wav2vec 2.0 framework
☆380Nov 22, 2021Updated 4 years ago
splitline / NTUSTapp
View on GitHub
一個屬於台科人的 App。
☆13Jul 30, 2018Updated 7 years ago
OmarMohammed88 / AR-Emotion-Recognition
View on GitHub
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…
☆16Feb 17, 2022Updated 4 years ago
Y5neKO / Tacotron2_Chinese
View on GitHub
基于Tacotron2进行语音模型训练
☆14Oct 23, 2022Updated 3 years ago
UniversalDependencies / UD_Chinese-HK
View on GitHub
Spoken mandarin Chinese from Hong Kong.
☆13May 6, 2026Updated 2 months ago
qinyuenlp / wav2vec_finetune
View on GitHub
ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
khanld / ASR-Wav2vec-Finetune
View on GitHub
Finetune Wa2vec 2.0 For Speech Recognition
☆150Feb 6, 2025Updated last year
ictnlp / BT4ST
View on GitHub
Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".
☆11Oct 25, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gauthamsuresh09 / wav2vec2-large-xlsr-53-malayalam
View on GitHub
Wav2vec2 Large XLSR 53 fine-tuned for Malayalam
☆11Sep 7, 2021Updated 4 years ago
TencentGameMate / chinese_speech_pretrain
View on GitHub
chinese speech pretrained models
☆1,211Aug 23, 2024Updated last year
MingLunHan / CIF-ColDec
View on GitHub
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
☆25Jul 14, 2026Updated 2 weeks ago
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
voidful / llm-codec
View on GitHub
LLM-Codec: Neural Audio Codec Meets Language Model Objectives
☆23May 3, 2026Updated 2 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
SkyOL5 / VQA-CoAttention
View on GitHub
☆12Aug 29, 2019Updated 6 years ago
Xianchao-Wu / wenet-deep-sparse-conformer
View on GitHub
☆15Aug 25, 2022Updated 3 years ago
HarunoriKawano / Wav2vec2.0
View on GitHub
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
☆60May 19, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
vivraj17 / Detection-Of-Parkinson-s-Disesase-Using-Voice-Impairments-With-ML-and-LSTM
View on GitHub
Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…
☆12Apr 1, 2019Updated 7 years ago
fushengwuyu / R-Drop
View on GitHub
RDrop 的 torch版
☆16Jul 15, 2021Updated 5 years ago
pardnchiu / HakoRun
View on GitHub
A Go FaaS platform with Bubblewrap sandboxing, Redis script versioning, and SSE streaming
☆19Jul 13, 2026Updated 2 weeks ago
LingweiMeng / Whisper-Sidecar
View on GitHub
The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".
☆34Aug 2, 2025Updated 11 months ago
cuhealthybrains / MT-LLM
View on GitHub
The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"
☆51Apr 7, 2025Updated last year