Pre-trained Wav2vec2.0 for Mandarin
☆43Oct 30, 2022Updated 3 years ago
Alternatives and similar repositories for Mandarin-Wav2Vec2
Users that are interested in Mandarin-Wav2Vec2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark corpus for ASR hypothesis revising task☆21Sep 26, 2023Updated 2 years ago
- A light webserver for monitoring RAM and GPU usage on multiple servers.☆21Mar 31, 2021Updated 5 years ago
- 臺科併校小幫手 🍡☆13Apr 21, 2023Updated 2 years ago
- Leaderboard and code for "Speech-IFEval", Interspeech 2025☆24May 27, 2025Updated 10 months ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆125Jul 15, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Code for DeSTA2.5-Audio, general-purpose LALM☆136Feb 4, 2026Updated 2 months ago
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆171Sep 21, 2020Updated 5 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Jul 3, 2025Updated 9 months ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- 一個透過Google App Script發送台科公佈欄資訊的機器人☆23Sep 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- 基于Tacotron2进行语音模型训练☆14Oct 23, 2022Updated 3 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 8 months ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- 一個屬於台科人的 App。☆13Jul 30, 2018Updated 7 years ago
- A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021☆47Feb 21, 2022Updated 4 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆15Feb 17, 2022Updated 4 years ago
- ASR: fine-tune wav2vec 2.0 with transformers☆21Sep 13, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Dec 6, 2024Updated last year
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- chinese speech pretrained models☆1,197Aug 23, 2024Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆151Feb 6, 2025Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆27Jan 8, 2026Updated 3 months ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Aug 25, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆14Oct 8, 2020Updated 5 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆58May 19, 2023Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago