ASR: fine-tune wav2vec 2.0 with transformers
☆21Sep 13, 2021Updated 4 years ago
Alternatives and similar repositories for wav2vec_finetune
Users that are interested in wav2vec_finetune are comparing it to the libraries listed below
Sorting:
- Finetune Wa2vec 2.0 For Speech Recognition☆146Feb 6, 2025Updated last year
- Ecr-helper is a tool for call recording☆26Apr 18, 2025Updated 10 months ago
- ☆15Aug 30, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Sep 13, 2020Updated 5 years ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 2 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- 天津财经大学2019年统计学院机器学习讨论班☆13Dec 9, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- B站视频课程配套资料☆40Jun 12, 2023Updated 2 years ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- ☆14Sep 17, 2024Updated last year
- ☆14Jan 9, 2025Updated last year
- python爬取历年天气并用pyecharts可视化分析☆10Dec 23, 2021Updated 4 years ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- ☆37Dec 23, 2020Updated 5 years ago
- ☆16Nov 11, 2025Updated 3 months ago
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"☆15Nov 26, 2025Updated 3 months ago
- ☆10Nov 28, 2020Updated 5 years ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Feb 13, 2026Updated 3 weeks ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 6 months ago
- ☆12Aug 5, 2022Updated 3 years ago
- [ECCV24] MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos☆11Oct 7, 2024Updated last year
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- Official PyTorch code for "Vector Quantization Prompting for Continual Learning (NeurIPS2024)".☆10Oct 16, 2024Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Sep 11, 2023Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- ☆12Jun 21, 2023Updated 2 years ago
- asr2k☆52Jun 2, 2024Updated last year
- 购物车数字加减框(HTML+CSS+JS一条龙)☆13Mar 21, 2018Updated 7 years ago
- Dataset for the paper 'Interview Choice Reveals Your Preference on the Market: To Improve Job-Resume Matching through Profiling Memories'☆10Aug 7, 2019Updated 6 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year