A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
☆11Jun 23, 2024Updated last year
Alternatives and similar repositories for dar
Users that are interested in dar are comparing it to the libraries listed below
Sorting:
- COMET for African languages☆10Jan 24, 2025Updated last year
- ☆20May 25, 2024Updated last year
- A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple desig…☆22Jan 27, 2024Updated 2 years ago
- ☆24Jun 21, 2024Updated last year
- Library for pruning experts per language pair in NLLB-200☆34Jul 7, 2023Updated 2 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 7 months ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆42Oct 13, 2022Updated 3 years ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆85Updated this week
- One of the problems faced concerning Arabic fake news detection is the scarcity of Arabic datasets. We believe it is important to availab…☆10Jun 13, 2022Updated 3 years ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Dec 13, 2018Updated 7 years ago
- ☆40Dec 25, 2022Updated 3 years ago
- Transliteration for languages and dialects☆44Jun 21, 2022Updated 3 years ago
- All code and content for my blog.☆15Sep 23, 2018Updated 7 years ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- ChatGPT app template using Pomerium, OpenAI Apps SDK and Model Context Protocol (MCP), with a Node.js server and React widgets.☆14Updated this week
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆35Oct 16, 2025Updated 4 months ago
- Time Series Explaination in Arabic | شرح السلاسل الزمنية باللغة العربية☆13May 24, 2022Updated 3 years ago
- Transformer Implementation for NMT using PyTorch Lightning (Korean to English)☆10Oct 19, 2020Updated 5 years ago
- A tiny wrapper for Arabic WordCloud plots☆10May 24, 2020Updated 5 years ago
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- ☆10Feb 2, 2024Updated 2 years ago
- ☆17Dec 2, 2025Updated 2 months ago
- An effort to benchmark Arabic legal reasoning in foundation models.☆17May 21, 2025Updated 9 months ago
- A python client for Belqis System☆43Feb 28, 2023Updated 3 years ago
- Customs Import Declaration Datasets☆10Feb 6, 2026Updated 3 weeks ago
- ☆31Dec 7, 2025Updated 2 months ago
- A simple library to display images in Jupyter notebooks☆15Dec 31, 2018Updated 7 years ago
- ☆11Apr 2, 2024Updated last year
- a Realtime Javascript Boilerplate base on Meteor Js Framework☆13May 25, 2015Updated 10 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆11Jun 27, 2022Updated 3 years ago
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 8 months ago
- Repo & Project for the Imminent Research Grant code & tasks☆12May 20, 2024Updated last year
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- A python package for testing if a dataset of numbers passes benford's law☆13Jan 20, 2021Updated 5 years ago
- The Open Multilingual Wordnet Project Page☆14May 29, 2023Updated 2 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Jan 4, 2024Updated 2 years ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- Business website☆10Feb 15, 2022Updated 4 years ago