[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
☆153May 18, 2025Updated last year
Alternatives and similar repositories for LiteASR
Users that are interested in LiteASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 4, 2025Updated 6 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆12Nov 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated 2 years ago
- A Streaming-Native Serving Engine for TTS/STS Models☆68Updated this week
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆25Oct 8, 2025Updated 8 months ago
- ☆19Mar 22, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year
- ☆23Jun 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆180Mar 18, 2024Updated 2 years ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆99Oct 8, 2025Updated 8 months ago
- ☆37Mar 26, 2024Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- ☆32Feb 4, 2025Updated last year
- Extract phoneme-level timestamps from speeh audio.☆142Updated this week
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Jun 16, 2023Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆94Jul 23, 2025Updated 10 months ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆78Oct 22, 2024Updated last year
- ☆18Jul 22, 2024Updated last year
- ☆13Dec 9, 2024Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆26Oct 10, 2023Updated 2 years ago
- An agentic system for autonomously generating explainable and reproducible time-series anomaly detection rules using LLMs.☆33May 20, 2026Updated 3 weeks ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆196Sep 24, 2025Updated 8 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆38May 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆112Apr 1, 2024Updated 2 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆55Sep 25, 2023Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆34Apr 22, 2026Updated last month
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated 2 years ago
- ☆81Jan 22, 2025Updated last year
- ☆41May 12, 2026Updated 3 weeks ago