Natural-Language-Processing-Elm / open_universal_arabic_asr_leaderboardLinks
☆59Updated 2 months ago
Alternatives and similar repositories for open_universal_arabic_asr_leaderboard
Users that are interested in open_universal_arabic_asr_leaderboard are comparing it to the libraries listed below
Sorting:
- The official implementation of CATT Arabic diacritization models.☆58Updated 6 months ago
- ☆63Updated 6 months ago
- 🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python package for offline speech synthesis 🚀📦☆36Updated last month
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Updated last year
- Code-Switched translations with Large Language models☆24Updated last year
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Updated 6 months ago
- [ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding☆60Updated 7 months ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 10 months ago
- Official Repository of the Deep Diacritization Paper☆17Updated 5 years ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆143Updated 3 months ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆36Updated 3 years ago
- ☆245Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆228Updated 8 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆181Updated 2 months ago
- EraX Text to Speech base on F5-TTS Base V1☆79Updated 8 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- Open TTS models, built for streaming on the edge☆44Updated 10 months ago
- ☆13Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 9 months ago
- Multilingual Speech Recognition for Indonesian Languages☆69Updated 3 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated last year
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆18Updated 5 years ago
- ☆37Updated 11 months ago
- ☆86Updated last year
- ☆158Updated last month
- Audio tokenization, in the fastest way possible!☆53Updated last year
- ☆16Updated 8 months ago