Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup
☆13Mar 18, 2024Updated 2 years ago
Alternatives and similar repositories for data2vec-aqc
Users that are interested in data2vec-aqc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 4 years ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- ☆14Jun 17, 2024Updated last year
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Nov 29, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- Making Espnet easier to use☆54Apr 9, 2021Updated 4 years ago
- Zerospeech Challenge 2021: validation and evaluation software☆12Jun 13, 2022Updated 3 years ago
- ☆11Jun 20, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- ☆16Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Oct 11, 2022Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Speeech Recognition for Indic languages.☆13Apr 3, 2021Updated 4 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 3 years ago
- ☆12Feb 5, 2023Updated 3 years ago
- ☆15May 14, 2025Updated 10 months ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and lear…☆10Sep 11, 2023Updated 2 years ago
- Using AI based approach to detect illegal parking of vehicles (Cars) from an image. The model will receive an image of parked car through…☆11Jun 2, 2020Updated 5 years ago
- The Gaming Zone is a web application that provides you with a collection of classic retro games, including puzzle games, trivia games, bo…☆10Feb 11, 2020Updated 6 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- Papers of ASR, Tools of ASR☆41Feb 14, 2025Updated last year
- Faster distil-whisper transcription with CTranslate2☆14Jan 23, 2024Updated 2 years ago
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- Things to help☆14Dec 11, 2024Updated last year
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- ☆18Aug 26, 2019Updated 6 years ago
- ☆10Oct 16, 2025Updated 5 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- ☆19May 2, 2024Updated last year
- The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023☆13Nov 9, 2023Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Extract Unique Word Lists From Wikipedia Database☆13May 27, 2020Updated 5 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago