zqs01/data2vecnoisy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zqs01/data2vecnoisy)

zqs01 / data2vecnoisy

☆11

Alternatives and similar repositories for data2vecnoisy

Users that are interested in data2vecnoisy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
YUCHEN005 / DPSL-ASR
View on GitHub
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆44May 23, 2023Updated 3 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
archiki / Robust-E2E-ASR
View on GitHub
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆49Dec 25, 2024Updated last year
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
YUCHEN005 / Gradient-Remedy
View on GitHub
Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"
☆21May 24, 2023Updated 3 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
mct10 / CoBERT
View on GitHub
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆48Nov 8, 2023Updated 2 years ago
hhhaaahhhaa / ASR-TTA
View on GitHub
☆16Nov 4, 2025Updated 8 months ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
mkunes / w2v2_audioFrameClassification
View on GitHub
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆42Aug 11, 2023Updated 2 years ago
jjery2243542 / semi-supervised-ASR
View on GitHub
☆10Dec 16, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jishengpeng / WavReward
View on GitHub
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
☆56May 15, 2025Updated last year
double22a / asr_nlp_paper_code
View on GitHub
Papers of ASR, Tools of ASR
☆41Feb 14, 2025Updated last year
thuhcsi / NeuFA
View on GitHub
Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
JaeDukSeo / Daily-Neural-Network-Practice-3
View on GitHub
Daily Neural Network Practice Season 3! ( Finishing up Masters)
☆10Sep 9, 2019Updated 6 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
metaspartan / inframon
View on GitHub
Inframon - Local Network Server Monitor for macOS and Linux AMD Machines!
☆17Nov 13, 2024Updated last year
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
YUCHEN005 / UNA-GAN
View on GitHub
Code for paper "Unsupervised Noise adaptation using Data Simulation"
☆14May 16, 2024Updated 2 years ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
quanpn90 / NMTGMinor
View on GitHub
A Neural Machine Translation toolkit for research purpose
☆82Feb 5, 2025Updated last year
arcadio / data-logic
View on GitHub
Expert system with deductive querying and verification of constraints expressed in natural language
☆23Feb 15, 2026Updated 5 months ago
SpeechColab / GigaSpeechBench
View on GitHub
☆27Jul 9, 2026Updated last week
NZII / ShipDataset
View on GitHub
Ship remote sensing dataset
☆12Jun 28, 2022Updated 4 years ago
NKU-HLT / KNN-CTC
View on GitHub
[ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels
☆42Mar 20, 2024Updated 2 years ago
voidful / SpeechMix
View on GitHub
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆46Jul 3, 2025Updated last year
s920128 / NAR-BERT-ASR
View on GitHub
NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
nafiuny / ICRCycleGAN-VC
View on GitHub
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Apr 15, 2026Updated 3 months ago
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
b04901014 / FG-transformer-TTS
View on GitHub
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆90Mar 5, 2022Updated 4 years ago
Srijith-rkr / KAUST-Whisper-Adapter
View on GitHub
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆41Sep 11, 2023Updated 2 years ago
playezio / WTBID
View on GitHub
Wind Turbine Blade Image Dateset
☆14May 23, 2019Updated 7 years ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago