IMLHF/SpecAugmentPyTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IMLHF/SpecAugmentPyTorch)

IMLHF / SpecAugmentPyTorch

A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

☆11

Alternatives and similar repositories for SpecAugmentPyTorch

Users that are interested in SpecAugmentPyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FighterNan / Image-Stitching-Three-Methods
View on GitHub
In this project, based on the idea of feature points matching, I used three methods to finish the image stitching assignment, which conta…
☆10Mar 31, 2017Updated 9 years ago
arxrean / LipRead-seq2seq
View on GitHub
An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.
☆10May 13, 2020Updated 6 years ago
MohammadYehya / Quantum_Edge_Detection
View on GitHub
Quantum Edge Detection using different Quantum Encoding strategies which include Flexible Representation of Quantum Images (FRQI), Novel …
☆16Feb 15, 2025Updated last year
ChristofHenkel / kaggle-birdclef24-3rd-place-solution-dieter
View on GitHub
☆13Jun 24, 2024Updated 2 years ago
ExileSaber / Industry-Keypoint-Detection
View on GitHub
基于自己标注的工业图像的关键点检测，每张图片标注了4个关键点，采用的U-net网络
☆12Jul 29, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xiuyu0000 / new_papers_codes
View on GitHub
☆16Sep 24, 2022Updated 3 years ago
DataXujing / TTS-paper
View on GitHub
🔥 语音合成（TTS）,语音克隆教程: https://dataxujing.github.io/TTS-paper/#/
☆11Oct 29, 2024Updated last year
joonkeekim / Instructive-Decoding
View on GitHub
Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…
☆21Mar 7, 2024Updated 2 years ago
rahul-t-p / ASVspoof-2019
View on GitHub
☆10Oct 25, 2019Updated 6 years ago
ChenKaiXuSan / WGAN-GP-PyTorch
View on GitHub
Pytorch implementation of WGAN with gradient penalty (WGAN-GP),
☆12Feb 7, 2022Updated 4 years ago
zdyshine / beat_track_mgtv_baseline
View on GitHub
☆16Jul 20, 2021Updated 5 years ago
ychennay / attention-facial-recognition
View on GitHub
Scripts that utilize class activation maps and self-attention layers within Keras models to classify faces from FEI Faces Dataset
☆11Oct 12, 2021Updated 4 years ago
fangfm / lcnn
View on GitHub
A TensorFlow implementation of light convolutional neural network (LCNN)
☆12Dec 27, 2018Updated 7 years ago
WisdomShell / FreeEval
View on GitHub
☆19Aug 3, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ohmydroid / AugShuffleNet-Plus
View on GitHub
Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly
☆10Feb 17, 2025Updated last year
alanjeffares / TANGOS
View on GitHub
Implementation of Tabular Neural Gradient Orthogonalization and Specialization (TANGOS). A regularizer for neural networks described in o…
☆19Feb 23, 2023Updated 3 years ago
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
halsay / MFCC_tutorial
View on GitHub
MFCC implementation with detailed comments.
☆17Nov 26, 2020Updated 5 years ago
ZhouYuxuanYX / SP-ViT-Learning-2D-Spatial-Priors-for-Vision-Transformers
View on GitHub
This is the official implementation of our BMVC 2022 paper "SP-ViT: Learning 2D Spatial Priors for Vision Transformers"
☆14Mar 27, 2023Updated 3 years ago
hustcc / PrimeTable
View on GitHub
Primes Table which Less then Ten million (10000000) 一千万以内的素数表（质数表）。
☆13May 9, 2016Updated 10 years ago
QQQYang / QNN-Baseline
View on GitHub
provide benchmarks for multiple QNNs
☆11Nov 5, 2023Updated 2 years ago
WillBrennan / MotionDetector
View on GitHub
a motion detector for video; written with OpenCV
☆12Nov 3, 2022Updated 3 years ago
AssemblyAI / kaldi-asr-tutorial
View on GitHub
Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI
☆13May 20, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
somaonishi / MTR
View on GitHub
The official implementation of the paper "Rethinking Data Augmentation for Tabular Data in Deep Learning"
☆14Oct 8, 2023Updated 2 years ago
tstafylakis / Speaker-Embeddings-Correlation-Pooling
View on GitHub
Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"
☆11Sep 20, 2021Updated 4 years ago
AmandineBtto / Batvision-Dataset
View on GitHub
A large-scale real-world audio-visual dataset for research on 3D scene understanding and echolocation.
☆22Oct 21, 2025Updated 9 months ago
Jindi0 / SQNN
View on GitHub
Scalable Quantum Neural Network builds and trains a large-scale QNN in a modular fashion. SQNN is evaluated with a binary classification …
☆12Oct 4, 2023Updated 2 years ago
dylanliu2 / BirdCLEF2025-4th-place-solution
View on GitHub
☆15Jun 11, 2025Updated last year
frednam93 / FilterAugSED
View on GitHub
☆68Sep 13, 2024Updated last year
bioidiap / bob.bio.spear
View on GitHub
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Jun 24, 2023Updated 3 years ago
epang-ucas / Evaluate_LLMs_to_Genes
View on GitHub
☆19May 25, 2024Updated 2 years ago
izlandman / iVector
View on GitHub
introduction to iVectors with available speech data
☆11Mar 4, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JiJiJiang / ASV-Anti-Spoofing-DADA
View on GitHub
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
☆19Updated this week
dulucas / Material_Segmentation_PyTorch
View on GitHub
Unofficial Re-Implementation of paper "Material Recognition in the Wild with the Materials in Context Database"(CVPR15)
☆13Dec 8, 2020Updated 5 years ago
CjangCjengh / japanese_g2p
View on GitHub
A Japanese G2P tool based on pyopenjtalk
☆25Aug 6, 2022Updated 3 years ago
vincenzodentamaro / aucoresnet
View on GitHub
AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath
☆13Mar 18, 2022Updated 4 years ago
kevinz8866 / MobileFormer
View on GitHub
Code and models for mobile-former
☆11Jan 10, 2023Updated 3 years ago
tcmyxc / FocalLoss
View on GitHub
分类任务的 Focal Loss，PyTorch 实现
☆10Jun 13, 2023Updated 3 years ago
matibilkis / qvans
View on GitHub
A semi-agnostic ansatz with variable structure for variational quantum algorithms. Published in Quantum Machine Intelligence (2023). Opti…
☆12Jan 4, 2026Updated 6 months ago