Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
Alternatives and similar repositories for Mandarin-Wav2Vec2
Users that are interested in Mandarin-Wav2Vec2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark corpus for ASR hypothesis revising task☆21Sep 26, 2023Updated 2 years ago
- A light webserver for monitoring RAM and GPU usage on multiple servers.☆21Mar 31, 2021Updated 5 years ago
- 臺科大程式設計社 2019 spring☆25May 28, 2019Updated 6 years ago
- 臺科併校小幫手 🍡☆13Apr 21, 2023Updated 3 years ago
- Leaderboard and code for "Speech-IFEval", Interspeech 2025☆24May 27, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆126Jul 15, 2025Updated 9 months ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Code for DeSTA2.5-Audio, general-purpose LALM☆139Feb 4, 2026Updated 3 months ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆171Sep 21, 2020Updated 5 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- 一個透過Google App Script發送台科公佈欄資訊的機器人☆23Sep 22, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- 基于Tacotron2进行语音模型训练☆14Oct 23, 2022Updated 3 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- 一個屬於台科人的 App。☆13Jul 30, 2018Updated 7 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Feb 21, 2022Updated 4 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆16Feb 17, 2022Updated 4 years ago
- ASR: fine-tune wav2vec 2.0 with transformers☆21Sep 13, 2021Updated 4 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆11Oct 25, 2023Updated 2 years ago
- chinese speech pretrained models☆1,203Aug 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Finetune Wa2vec 2.0 For Speech Recognition☆151Feb 6, 2025Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆27Jan 8, 2026Updated 4 months ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- ☆19Aug 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆14Oct 8, 2020Updated 5 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆60May 19, 2023Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- RDrop 的 torch版☆16Jul 15, 2021Updated 4 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago