riotu-lab / aranizerView external linksLinks
Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling
☆21Aug 4, 2024Updated last year
Alternatives and similar repositories for aranizer
Users that are interested in aranizer are comparing it to the libraries listed below
Sorting:
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 4 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆113Sep 2, 2021Updated 4 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- ☆55Jul 21, 2024Updated last year
- ☆25Jun 25, 2019Updated 6 years ago
- ☆128Mar 3, 2024Updated last year
- This is the second part of the Deep Learning Course for the Master in High-Performance Computing (SISSA/ICTP).)☆33Sep 15, 2020Updated 5 years ago
- ☆34Dec 14, 2023Updated 2 years ago
- Official code for PLoP☆17Jun 30, 2025Updated 7 months ago
- الذكاء الاصطناعي التوليدي باللغة العربية☆38Aug 7, 2024Updated last year
- ☆18Jun 25, 2025Updated 7 months ago
- ☆11Feb 26, 2024Updated last year
- ☆12Sep 27, 2024Updated last year
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- Sakhi, a mobile-first app tailored for women, encompasses daily journals, safety features, community, and holistic health tools. Elevate …☆11Mar 7, 2024Updated last year
- ☆36Jul 16, 2021Updated 4 years ago
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Aug 2, 2021Updated 4 years ago
- A comprehensive list of Arabic NLP resources.☆43Sep 7, 2025Updated 5 months ago
- News clustering algorithm. Implementation of the "Multilingual Clustering of Streaming News" paper submitted to EMNLP 2018☆38May 2, 2022Updated 3 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆164Aug 4, 2023Updated 2 years ago
- ☆11Jul 19, 2018Updated 7 years ago
- Autopilot for DJI Tello Drone using deep learning and image processing in python☆11Nov 3, 2019Updated 6 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- ☆12Mar 3, 2023Updated 2 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Kaggle | 22nd place solution for RANZCR CLiP - Catheter and Line Position Challenge.☆11Apr 19, 2021Updated 4 years ago
- A free and online Python book in Persian☆13Apr 10, 2023Updated 2 years ago
- This Python script retrieves and analyzes scientific literature from PubMed related to specific genes, creates a word cloud visualization…☆11Nov 15, 2023Updated 2 years ago
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆10Jan 27, 2025Updated last year
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- Using Pytorch to solve the famous titanic dataset problem, or in other words, killing a fly with a tank.☆11Apr 8, 2019Updated 6 years ago