PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
☆18Apr 25, 2021Updated 4 years ago
Alternatives and similar repositories for conformer
Users that are interested in conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆29May 1, 2024Updated last year
- Room Impulse Response Generator☆19May 20, 2018Updated 7 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- A curated collection of prompts for Grok Imagine by xAI☆26Oct 19, 2025Updated 5 months ago
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 11 months ago
- The repositroy for "Sensing-aided CSI Feedback with Deep Learning for Massive MIMO Systems", which has been submitted to IEEE for possibl…☆13Aug 23, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Script to download corpora from the Linguistic Data Consortium (LDC)☆34Aug 6, 2024Updated last year
- ☆11Sep 25, 2022Updated 3 years ago
- A pytorch implementation of Graph Neural Networks-Based User Pairing in Wireless Communication Systems☆12Sep 16, 2024Updated last year
- Robust Principal Component Analysis☆10Apr 1, 2014Updated 12 years ago
- Patch-Diffusion Code (AAAI2022)☆13Mar 3, 2022Updated 4 years ago
- The code used to create the ARCA23K and ARCA23K-FSD datasets☆15Nov 9, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- ☆12Dec 14, 2024Updated last year
- ☆27Apr 18, 2019Updated 6 years ago
- ☆23Jul 4, 2020Updated 5 years ago
- atexit replacement that supports multiprocessing☆15Oct 22, 2019Updated 6 years ago
- Visually-Aware Audio Captioning☆43Mar 3, 2023Updated 3 years ago
- Image-source method for room acoustics☆14Feb 5, 2020Updated 6 years ago
- ☆11Nov 1, 2023Updated 2 years ago
- "Make-A-Video", new SOTA text to video by Meta-FAIR - Tensorflow☆14Oct 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official implementation of the paper "Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks".☆19Apr 19, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Variational Auto-Encoder implementation in Tensorflow☆10Jan 22, 2017Updated 9 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- ☆12May 12, 2021Updated 4 years ago
- ☆11Sep 1, 2024Updated last year
- Synthesizes a room impulse response using a ray tracing simulation engine.☆13Mar 22, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆48Jan 8, 2021Updated 5 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- Implementation of Learning Bandwidth Expansion Using Perceptually-Motivated Loss (ICASSP 2019)☆11May 18, 2022Updated 3 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆31Aug 9, 2022Updated 3 years ago