ahmedheakl / arazn-llmLinks
Code-Switched translations with Large Language models
☆21Updated 7 months ago
Alternatives and similar repositories for arazn-llm
Users that are interested in arazn-llm are comparing it to the libraries listed below
Sorting:
- The official implementation of CATT Arabic diacritization models.☆46Updated last month
- ☆49Updated last week
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆16Updated 10 months ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated 11 months ago
- ☆42Updated 2 years ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆35Updated 3 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated 11 months ago
- ☆124Updated last year
- A comprehensive list of Arabic NLP resources.☆33Updated last month
- Code for Arabic Nougat☆42Updated 7 months ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆42Updated last year
- TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format☆25Updated 3 months ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆16Updated 5 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆59Updated last month
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆32Updated 4 months ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆40Updated 3 months ago
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- Neural Arabic text diacritization☆92Updated 2 years ago
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆12Updated this week
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- TTS models for Arabic (Tacotron2, FastPitch)☆119Updated 8 months ago
- Finetune VITS and MMS using HuggingFace's tools☆159Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆115Updated last month
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆25Updated 7 months ago
- Benchmark Arabic text diacritization dataset☆75Updated 5 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆61Updated 8 years ago