leobitz / amharic_word_embeddingsLinks
☆12Updated 3 years ago
Alternatives and similar repositories for amharic_word_embeddings
Users that are interested in amharic_word_embeddings are comparing it to the libraries listed below
Sorting:
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆103Updated 5 months ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆130Updated last year
- Open source speech to text models for Indic Languages☆306Updated 3 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆53Updated 5 years ago
- Awesome List of Tamil NLP & AI Resources☆113Updated 2 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆60Updated 11 months ago
- Infographic about the inner computations of a transformer model, training and inference☆86Updated last year
- ☆111Updated last year
- ☆17Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆26Updated last year
- An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way p…☆300Updated last year
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- MAFAND-MT☆58Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- ☆32Updated last year
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆290Updated 2 years ago
- A collaborative catalog of NLP resources for Indic languages☆615Updated 9 months ago
- Transliteration models for 21 Indic languages☆98Updated last year
- Translation models for 22 scheduled languages of India☆370Updated 4 months ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆200Updated 5 years ago
- A Continually LoRA PreTrained and FineTuned 7B Llama-2 Indic model for Malayalam Language.☆60Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- Language Identification for Indian languages☆23Updated last year
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆36Updated last year
- A collection of paper implementations using the PyTorch framework☆28Updated 4 years ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆89Updated 8 months ago
- Edge Inference in Browser with Transformer NLP model☆314Updated 3 years ago
- A curated list of NLP Resources for the Nepali Language☆25Updated 2 years ago