pashanitw / W2V2-BERT-ASR-TrainingView external linksLinks
☆15Mar 25, 2024Updated last year
Alternatives and similar repositories for W2V2-BERT-ASR-Training
Users that are interested in W2V2-BERT-ASR-Training are comparing it to the libraries listed below
Sorting:
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆12Nov 14, 2024Updated last year
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- ☆17May 5, 2024Updated last year
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- A TTS model that makes a speaker speak new languages☆76Jun 18, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- Wav2vec 2.0 Self-Supervised Pretraining☆58Feb 6, 2025Updated last year
- ☆25Mar 6, 2024Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- 🩺🎧 Fix all your podcast, video or live stream audio! 🎧🩺☆27Jun 3, 2024Updated last year
- Tacotron2 for Korean (taKotron2)☆34Apr 8, 2022Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- Implementation of Google's USM speech model in Pytorch☆34Feb 7, 2026Updated last week
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆39Feb 20, 2025Updated 11 months ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆38Nov 30, 2023Updated 2 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- A beginner-friendly inference to finetune & run inference on open TTS models 🗣️☆26Feb 4, 2026Updated last week
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- 🥑 Intellij plugin to optimization Vector Drawable 🥑☆11Apr 7, 2019Updated 6 years ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆94Dec 3, 2024Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Aug 11, 2023Updated 2 years ago
- Automatically extract grammatical edits from parallel original and corrected sentences.☆11May 21, 2017Updated 8 years ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- ☆10Sep 2, 2024Updated last year
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- ☆10Mar 7, 2024Updated last year
- ☆10Mar 22, 2024Updated last year
- "Artificial General Intelligence For All (AGIFA)" Project☆12Feb 25, 2024Updated last year
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- Simple Drag and Drop component of multiple UITableView written in Swift☆13Jan 6, 2023Updated 3 years ago