Elma-dev / TODaLinks
TODa: Tamazight Open Dataset
☆16Updated 6 months ago
Alternatives and similar repositories for TODa
Users that are interested in TODa are comparing it to the libraries listed below
Sorting:
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆88Updated last year
- A list of Moroccan Darija Datasets grouped by name, data source, region and size.☆53Updated last year
- ☆74Updated last year
- Awesome Darija Arabic NLP Resources☆16Updated 3 months ago
- ☆18Updated 3 weeks ago
- ☆43Updated 2 months ago
- darija <-> english dataset☆337Updated 4 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 8 months ago
- Code for Arabic Nougat☆44Updated 8 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- The Multilayer Perceptron Language Model☆558Updated last year
- A Hands on series on developing LLM applications☆65Updated 10 months ago
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆200Updated last year
- The Tensor (or Array)☆441Updated 11 months ago
- How to install CUDA & cuDNN for Machine Learning☆20Updated last year
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆45Updated 11 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆34Updated 2 weeks ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- ☆123Updated last year
- AI research lab🔬: implementations of AI papers and theoretical research: InstructGPT, llama, transformers, diffusion models, RLHF, etc..…☆19Updated 4 months ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 7 months ago
- A repository to group reports & presentations of end of studies projects (PFE) of students in the software engineering field in Morocco �…☆30Updated last year
- NanoTorch is Deep Learning Library from scratch using Numpy and Math.☆21Updated last year
- GPU Kernels☆191Updated 3 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Building GPT ...☆18Updated 8 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆107Updated 10 months ago
- ☆677Updated 3 months ago