Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
Alternatives and similar repositories for Mandarin-Wav2Vec2
Users that are interested in Mandarin-Wav2Vec2 are comparing it to the libraries listed below
Sorting:
- A benchmark corpus for ASR hypothesis revising task☆21Sep 26, 2023Updated 2 years ago
- A light webserver for monitoring RAM and GPU usage on multiple servers.☆21Mar 31, 2021Updated 4 years ago
- 臺科大程式設計社 2019 spring☆25May 28, 2019Updated 6 years ago
- 臺科併校小幫手 🍡☆13Apr 21, 2023Updated 2 years ago
- Leaderboard and code for "Speech-IFEval", Interspeech 2025☆24May 27, 2025Updated 9 months ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆121Jul 15, 2025Updated 7 months ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- 一個屬於台科人的 App。☆13Jul 30, 2018Updated 7 years ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Code for DeSTA2.5-Audio, general-purpose LALM☆128Feb 4, 2026Updated last month
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆24Jan 8, 2026Updated 2 months ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆169Sep 21, 2020Updated 5 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Sep 11, 2023Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 8 months ago
- 基于Tacotron2进行语音模型训练☆15Oct 23, 2022Updated 3 years ago
- 一個透過Google App Script發送台科公佈欄資訊的機器人☆23Sep 22, 2022Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- ASR: fine-tune wav2vec 2.0 with transformers☆21Sep 13, 2021Updated 4 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- chinese speech pretrained models☆1,191Aug 23, 2024Updated last year
- ☆25Feb 12, 2023Updated 3 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated 11 months ago
- Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.☆211Jan 18, 2024Updated 2 years ago
- Wind Turbine Blade Image Dateset☆13May 23, 2019Updated 6 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆46Dec 1, 2025Updated 3 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆146Feb 6, 2025Updated last year
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- ☆37Jun 28, 2021Updated 4 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Машинне навчання для інженерів із систем керування☆11Jul 19, 2023Updated 2 years ago