PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
☆18Apr 25, 2021Updated 5 years ago
Alternatives and similar repositories for conformer
Users that are interested in conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆29May 1, 2024Updated 2 years ago
- Official codes for "DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision"☆13Nov 29, 2023Updated 2 years ago
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago
- Code implementation of LFI-CAM with PyTorch☆10Jun 9, 2021Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆220Jun 22, 2023Updated 2 years ago
- Script to download corpora from the Linguistic Data Consortium (LDC)☆34Aug 6, 2024Updated last year
- Robust Principal Component Analysis☆10Apr 1, 2014Updated 12 years ago
- Patch-Diffusion Code (AAAI2022)☆13Mar 3, 2022Updated 4 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆438May 17, 2023Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- ☆12Dec 14, 2024Updated last year
- atexit replacement that supports multiprocessing☆15Oct 22, 2019Updated 6 years ago
- ☆27Apr 18, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Jul 4, 2020Updated 5 years ago
- Image-source method for room acoustics☆15Feb 5, 2020Updated 6 years ago
- Visually-Aware Audio Captioning☆43Mar 3, 2023Updated 3 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆16Nov 9, 2021Updated 4 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Conformer: Convolution-augmented Transformer for Speech Recognition☆15Sep 4, 2025Updated 9 months ago
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- 数据挖掘(实战代码/欢迎讨论/大量注释/机器学习). 你将习得,如:数据的处理、LightGBM、GridSearchCV寻找最优参、StratifiedKFold分层5折切分、画AUC图、输出预测名单等。☆19Feb 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- ☆12May 12, 2021Updated 5 years ago
- Synthesizes a room impulse response using a ray tracing simulation engine.☆13Mar 22, 2017Updated 9 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- Implementation of Learning Bandwidth Expansion Using Perceptually-Motivated Loss (ICASSP 2019)☆11May 18, 2022Updated 4 years ago
- This project is used in pytorch framework to implement high precision semantic segmentation of RGB images.☆17Nov 7, 2023Updated 2 years ago
- Source data, scripts and makefiles of the experiment for the Speex codec quality evaluation☆23Aug 29, 2011Updated 14 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆31Aug 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 6 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Examples for TensorFlow Weight Normalization☆14Apr 19, 2019Updated 7 years ago
- "Recipes" for doing various setup and config tasks☆12Nov 15, 2017Updated 8 years ago
- ☆30Jul 21, 2022Updated 3 years ago
- multichannel linear filters based on mask estimation neural networks for CHiME4☆38May 14, 2018Updated 8 years ago
- ☆10Jul 12, 2019Updated 6 years ago