MohamedAliRashad / youtube-audio-collectorLinks

Simple script to collect code switching audio and captions data

☆8

Alternatives and similar repositories for youtube-audio-collector

Users that are interested in youtube-audio-collector are comparing it to the libraries listed below

Sorting:

UBC-NLP / peacock
This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
☆25Updated 6 months ago
ahmedheakl / arazn-llm
Code-Switched translations with Large Language models
☆21Updated 6 months ago
ARBML / Taqyim
Python intefrace for evaluation on chatgpt models
☆19Updated last year
riotu-lab / aranizer
Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling
☆20Updated 10 months ago
abjadai / catt
The official implementation of CATT Arabic diacritization models.
☆46Updated 3 weeks ago
sanchit-gandhi / notebooks
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
☆45Updated 10 months ago
mbzuai-nlp / ArTST
☆44Updated 2 weeks ago
ARBML / CIDAR
Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
☆40Updated 2 months ago
h9-tect / Arabic_nlp_preprocessing
☆44Updated 11 months ago
Mohammed-Majid / ML_Roadmap
Comprehensive Machine Learning Roadmap
☆23Updated 8 months ago
FreedomIntelligence / AceGPT
☆124Updated last year
ARBML / whisperar
☆42Updated 2 years ago
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 8 months ago
MohamedAliRashad / arabic-nougat
Code for Arabic Nougat
☆42Updated 6 months ago
OmarMohammed88 / AR-Emotion-Recognition
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…
☆13Updated 3 years ago
msalhab96 / AraSpell
A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs
☆22Updated 11 months ago
alexander-moore / vlm
Composition of Multimodal Language Models From Scratch
☆14Updated 10 months ago
hkproj / multi-latent-attention
☆39Updated last month
sanchit-gandhi / codesnippets
☆10Updated last year
KevKibe / African-Whisper
🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.
☆27Updated 4 months ago
joejoe03 / Egyptian-Text-To-Speech
☆26Updated 4 months ago
misbahsy / tafsir_semantic_search
This repo is for semantic search app to search over Quran tafsir books
☆24Updated 11 months ago
voxos-ai / streaming-whisper-server
A streaming whisper server for on-prem transcription
☆20Updated 10 months ago
luigisaetta / whisper-app
This repository contains all the work I have done (and I'm doing) in developing a web app for speech-to-text, based on OpenAI Whisper
☆9Updated 2 years ago
BKHMSI / deep-diacritization
Official Repository of the Deep Diacritization Paper
☆16Updated 4 years ago
ThinamXx / cuda-mode
Making of cuda kernel
☆16Updated last month
mesolitica / dynamic-batch-RAG-pipeline
Dynamic batching for Document Layout and OCR, suitable for RAG, with extra tools.
☆11Updated 7 months ago
VK-Ant / ComputerVision-Upgrade-Project
Eye exploration
☆28Updated 4 months ago
mznmel / Pico-Saudi-LLMs-Benchmark
أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs
☆16Updated 5 months ago
IbrahimSobh / Object-Detection
In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design …
☆70Updated 3 years ago