Speech-Lab-IITM/data2vec-aqc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Speech-Lab-IITM/data2vec-aqc)

Speech-Lab-IITM / data2vec-aqc

Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

☆13

Alternatives and similar repositories for data2vec-aqc

Users that are interested in data2vec-aqc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KrishnaDN / E2E_ASR_Confidence_Estimation
View on GitHub
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 5 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
X-LANCE / public_talks
View on GitHub
Materials of public talks given By SJTU X-LANCE members
☆14Dec 3, 2022Updated 3 years ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
huangruizhe / ConEC
View on GitHub
☆14Jun 17, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
skit-ai / N-Best-ASR-Transformer
View on GitHub
Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."
☆17Nov 30, 2021Updated 4 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
zerospeech / zerospeech2021
View on GitHub
Zerospeech Challenge 2021: validation and evaluation software
☆12Jun 13, 2022Updated 4 years ago
NickyFot / ACMMM22_LearningLabelRelationships
View on GitHub
☆11Jun 20, 2023Updated 3 years ago
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
B06901052 / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆13Oct 11, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
averkij / Word-to-Number-Russian
View on GitHub
Проект для перевода чисел, записанных в текстовом виде на русском языке.
☆11Apr 5, 2022Updated 4 years ago
notAI-tech / IndicASR
View on GitHub
Speeech Recognition for Indic languages.
☆13Apr 3, 2021Updated 5 years ago
MRYangY / AAudioDemo
View on GitHub
☆12Feb 5, 2023Updated 3 years ago
rasenganai / Illegal_Parking
View on GitHub
Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…
☆11Jun 2, 2020Updated 6 years ago
verma-anushka / Gaming-Zone
View on GitHub
The Gaming Zone is a web application that provides you with a collection of classic retro games, including puzzle games, trivia games, bo…
☆10Feb 11, 2020Updated 6 years ago
mohan696matlab / whisper-finetuning-youtube-serise
View on GitHub
☆16May 14, 2025Updated last year
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
gleb-skobinsky / RuCoref-inference
View on GitHub
Russian coreference resolution made as simple and accessible as could be
☆11Sep 3, 2022Updated 3 years ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
NeurAI-Lab / DoGo
View on GitHub
This is the official repo for the CVPR 2021 L2ID paper "Distill on the Go: Online knowledge distillation in self-supervised learning"
☆12Nov 15, 2021Updated 4 years ago
double22a / asr_nlp_paper_code
View on GitHub
Papers of ASR, Tools of ASR
☆41Feb 14, 2025Updated last year
metame-ai / faster-distil-whisper
View on GitHub
Faster distil-whisper transcription with CTranslate2
☆14Jan 23, 2024Updated 2 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
sharpmonk / Neptune4Pro
View on GitHub
Things to help
☆14Dec 11, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
GitHubOfHyl97 / SkeAttnCLR
View on GitHub
The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023
☆13Nov 9, 2023Updated 2 years ago
ewwink / wikipedia-wordlists-extractor
View on GitHub
Extract Unique Word Lists From Wikipedia Database
☆13May 27, 2020Updated 6 years ago
nicoboss / KickassCopy
View on GitHub
☆18Aug 26, 2019Updated 6 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
ZihanZhaoSJTU / LibriSQA
View on GitHub
☆39Aug 30, 2023Updated 2 years ago