☆43Sep 3, 2025Updated 6 months ago
Alternatives and similar repositories for VietASR
Users that are interested in VietASR are comparing it to the libraries listed below
Sorting:
- Add n-gram and large language model (LLM) support to Whisper models.☆41May 6, 2025Updated 9 months ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆33Feb 23, 2026Updated last week
- Official implementation of paper: "SwinTExCo: Exemplar-based Video Colorization using Swin Transformer"☆13Oct 6, 2024Updated last year
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)☆37Updated this week
- ☆14Jul 24, 2025Updated 7 months ago
- ☆25Jun 18, 2025Updated 8 months ago
- Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach. This repository includes the implementation of…☆17Jun 1, 2024Updated last year
- Code for DeSTA2.5-Audio, general-purpose LALM☆128Feb 4, 2026Updated last month
- Memory Agent monorepo☆83Oct 9, 2025Updated 4 months ago
- Universal text classifier for generative models☆24Jul 25, 2024Updated last year
- ☆139Apr 23, 2025Updated 10 months ago
- ☆41May 27, 2025Updated 9 months ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆37Apr 7, 2025Updated 10 months ago
- ☆114Oct 21, 2025Updated 4 months ago
- ☆60Jan 12, 2026Updated last month
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 24, 2026Updated last week
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Jan 5, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated last month
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆11Apr 25, 2025Updated 10 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆52May 22, 2025Updated 9 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆42Feb 13, 2025Updated last year
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆72Nov 11, 2025Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆11Jun 22, 2025Updated 8 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 4 months ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆42Updated this week
- End to End Speech to Speech with Emotion System☆15Feb 6, 2025Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Feb 9, 2026Updated 3 weeks ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Sampling☆51Jul 15, 2025Updated 7 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆16Sep 17, 2024Updated last year
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Updated this week
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago