A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆12Jul 24, 2024Updated last year
Alternatives and similar repositories for SpecAugmentPyTorch
Users that are interested in SpecAugmentPyTorch are comparing it to the libraries listed below
Sorting:
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- ☆31Oct 29, 2024Updated last year
- ☆31Aug 16, 2021Updated 4 years ago
- In this project, based on the idea of feature points matching, I used three methods to finish the image stitching assignment, which conta…☆10Mar 31, 2017Updated 8 years ago
- provide benchmarks for multiple QNNs☆11Nov 5, 2023Updated 2 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 5 months ago
- Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)☆11Nov 29, 2024Updated last year
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Modified version of the open source SDR LTE software suite from Software Radio Systems (SRS), allowing it to save the certain channel cha…☆10Aug 13, 2020Updated 5 years ago
- ☆42Oct 23, 2023Updated 2 years ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- A semi-agnostic ansatz with variable structure for variational quantum algorithms. Published in Quantum Machine Intelligence (2023). Opti…☆12Jan 4, 2026Updated last month
- ☆10Feb 24, 2022Updated 4 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- ☆10Jul 13, 2022Updated 3 years ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 3 years ago
- ☆13Dec 1, 2025Updated 3 months ago
- ☆11May 9, 2023Updated 2 years ago
- The repo host the code and model of MAViL.☆45Jul 24, 2023Updated 2 years ago
- Personalized Image Generation with Large Multimodal Models☆14May 13, 2025Updated 9 months ago
- 🔥 语音合成(TTS),语音克隆教程: https://dataxujing.github.io/TTS-paper/#/☆11Oct 29, 2024Updated last year
- ☆14Jan 5, 2022Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- ☆11Jun 24, 2024Updated last year
- Pytorch implementation of WGAN with gradient penalty (WGAN-GP),☆12Feb 7, 2022Updated 4 years ago
- ☆11Sep 1, 2024Updated last year
- ICASSP 2023 SPGC Challenge: Multilingual Alzheimer's Dementia Recognition through Spontaneous Speech☆11Jun 4, 2023Updated 2 years ago
- [ICML 2023] QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark☆11Mar 15, 2024Updated last year
- Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡☆11Jan 23, 2025Updated last year
- Scalable Quantum Neural Network builds and trains a large-scale QNN in a modular fashion. SQNN is evaluated with a binary classification …☆12Oct 4, 2023Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆15Sep 24, 2022Updated 3 years ago