leobitz / amharic_word_embeddingsLinks
☆12Updated 3 years ago
Alternatives and similar repositories for amharic_word_embeddings
Users that are interested in amharic_word_embeddings are comparing it to the libraries listed below
Sorting:
- Different semantic models for Amharic☆21Updated last year
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- ☆25Updated last year
- An Amharic News Text classification Dataset☆38Updated last year
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆10Updated last year
- ☆17Updated 2 years ago
- ☆363Updated 8 months ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆100Updated 3 months ago
- AmQA - The first Amharic Open Domain Question Answering Dataset☆12Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆25Updated last year
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT wa…☆118Updated 2 years ago
- Scripts to finetune the official implementation of OpenAI's Whisper model☆22Updated 2 weeks ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated 11 months ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆58Updated 4 months ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆170Updated 2 years ago
- Infographic about the inner computations of a transformer model, training and inference☆86Updated last year
- Open source speech to text models for Indic Languages☆306Updated 2 years ago
- Arabic cleaning, normalization and segmentation library.☆70Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆469Updated 3 months ago
- A collaborative catalog of NLP resources for Indic languages☆602Updated 7 months ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆174Updated last month
- ☆74Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.☆31Updated 2 years ago
- A Python implementation of Farasa toolkit☆132Updated last month
- A curated list of NLP Resources for the Nepali Language☆25Updated 2 years ago
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆11Updated 2 years ago