☆15Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for W2V2-BERT-ASR-Training
Users that are interested in W2V2-BERT-ASR-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆14Jun 6, 2023Updated 2 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆18Jul 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17May 5, 2024Updated 2 years ago
- ☆23Jun 24, 2024Updated last year
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Companion toolkit of the 'Serial Speakers' dataset.☆11Feb 17, 2020Updated 6 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated 2 years ago
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 6 years ago
- ☆13Nov 11, 2023Updated 2 years ago
- Dashboard showcasing Conjoint Analysis for the Electric Vehicle Lease Market (as at January 2020) in San Francisco☆14Feb 19, 2020Updated 6 years ago
- A TTS model that makes a speaker speak new languages☆76Jun 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch code for Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation (AQM+)☆10Feb 12, 2019Updated 7 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Tacotron2 for Korean (taKotron2)☆34Apr 8, 2022Updated 4 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- Wav2vec 2.0 Self-Supervised Pretraining☆60Feb 6, 2025Updated last year
- Slides and talks from presentations, workshops, etc'☆19Feb 1, 2026Updated 3 months ago
- ☆10Mar 10, 2021Updated 5 years ago
- A subset of the popular LibriTTS dataset with subsets for English, Scottish, Welsh, and Irish accents.☆16Mar 17, 2023Updated 3 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆41Jan 6, 2024Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆35Nov 19, 2024Updated last year
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Apr 15, 2026Updated last month
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- ☆55Jul 16, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Implementation of Google's USM speech model in Pytorch☆36May 11, 2026Updated 2 weeks ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14May 11, 2021Updated 5 years ago
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- NLP stuff with quantum computing☆17Nov 9, 2020Updated 5 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago