Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling
☆22Aug 4, 2024Updated last year
Alternatives and similar repositories for aranizer
Users that are interested in aranizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- A comprehensive list of Arabic NLP resources.☆46Sep 7, 2025Updated 8 months ago
- ☆56Jul 21, 2024Updated last year
- Intuitive graphical representation of source code☆14Mar 15, 2023Updated 3 years ago
- Scripts to finetune the official implementation of OpenAI's Whisper model☆25Apr 14, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End-to-End Arabic ASR using DeepSpeech engine☆14Nov 2, 2021Updated 4 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆117Sep 2, 2021Updated 4 years ago
- SQL Tutorials using Jupyter Notebook☆17Apr 9, 2023Updated 3 years ago
- ☆11May 11, 2024Updated 2 years ago
- ☆40Feb 1, 2025Updated last year
- ☆10Feb 2, 2024Updated 2 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- Sakhi, a mobile-first app tailored for women, encompasses daily journals, safety features, community, and holistic health tools. Elevate …☆12Mar 7, 2024Updated 2 years ago
- ☆12Jun 6, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 5 years ago
- A Question Generation Application leveraging RAG and Weaviate vector store to be able to retrieve relative contexts and generate a more u…☆17Feb 3, 2025Updated last year
- Python implementation of PayNow QR Code Generator☆19Dec 2, 2022Updated 3 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 5 years ago
- The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egy…☆18Mar 9, 2026Updated 2 months ago
- A neural and statistical engine for accurately adding diacritics (Tashkeel) to Arabic text. First-place winner on Kaggle 🥇☆18May 29, 2025Updated 11 months ago
- The system enables sophisticated coordination of multiple drones through natural language commands, visual inputs, and real-time environm…☆17Dec 15, 2025Updated 5 months ago
- ☆10Sep 19, 2022Updated 3 years ago
- ☆20May 25, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆20Nov 11, 2024Updated last year
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- An app that extends the bluetooth comms☆11May 31, 2023Updated 2 years ago
- ☆128Mar 3, 2024Updated 2 years ago
- This is the RobEn AI's team home made Discord bot. Custom made for the AI team Discord server, to serve.☆16May 26, 2021Updated 5 years ago
- Replace arabic numbers with engilsh ones.☆17Feb 3, 2016Updated 10 years ago
- ☆11Apr 26, 2023Updated 3 years ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆18Jan 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Using reinforcement learning to minimize fuel consuption when landing a rover on Mars☆12Mar 21, 2022Updated 4 years ago
- [Lab] lab website☆11May 18, 2026Updated last week
- Implemention of DVH prediction from the (contoured) anatomical scans ...☆11Jun 20, 2016Updated 9 years ago
- Personal coach to help you obtain desired AI decisions!☆20Oct 3, 2023Updated 2 years ago
- Arabic speech recognition, classification and text-to-speech.☆428Sep 30, 2023Updated 2 years ago
- Implementation of our paper in EMNLP 2022, focused on the relationship between parent and child in transfer learning for low-resourc…☆17Dec 7, 2022Updated 3 years ago
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆54Mar 13, 2025Updated last year