MohamedAliRashad / youtube-audio-collectorLinks
Simple script to collect code switching audio and captions data
☆8Updated 11 months ago
Alternatives and similar repositories for youtube-audio-collector
Users that are interested in youtube-audio-collector are comparing it to the libraries listed below
Sorting:
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆25Updated 6 months ago
- Code-Switched translations with Large Language models☆21Updated 6 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated 10 months ago
- The official implementation of CATT Arabic diacritization models.☆46Updated 3 weeks ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 10 months ago
- ☆44Updated 2 weeks ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆40Updated 2 months ago
- ☆44Updated 11 months ago
- Comprehensive Machine Learning Roadmap☆23Updated 8 months ago
- ☆124Updated last year
- ☆42Updated 2 years ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- Code for Arabic Nougat☆42Updated 6 months ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated 11 months ago
- Composition of Multimodal Language Models From Scratch☆14Updated 10 months ago
- ☆39Updated last month
- ☆10Updated last year
- 🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.☆27Updated 4 months ago
- ☆26Updated 4 months ago
- This repo is for semantic search app to search over Quran tafsir books☆24Updated 11 months ago
- A streaming whisper server for on-prem transcription☆20Updated 10 months ago
- This repository contains all the work I have done (and I'm doing) in developing a web app for speech-to-text, based on OpenAI Whisper☆9Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Making of cuda kernel☆16Updated last month
- Dynamic batching for Document Layout and OCR, suitable for RAG, with extra tools.☆11Updated 7 months ago
- Eye exploration☆28Updated 4 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆16Updated 5 months ago
- In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design …☆70Updated 3 years ago