masakhane-io / masakhane-posLinks
POS for African languages
☆19Updated 6 months ago
Alternatives and similar repositories for masakhane-pos
Users that are interested in masakhane-pos are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- MAFAND-MT☆60Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 3 months ago
- Crosslingual Question Answering for African Languages☆30Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- ☆12Updated last year
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆46Updated 2 years ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆74Updated 2 months ago
- ☆116Updated 2 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆35Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 5 months ago
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆12Updated 3 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 3 months ago
- ☆53Updated 10 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- ☆125Updated last year
- ☆10Updated last year
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆16Updated last year
- NLP Examples using the 🤗 libraries☆40Updated 4 years ago
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Updated 2 years ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated 2 years ago
- ☆32Updated 3 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- COMET for African languages☆10Updated 11 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 9 months ago
- ☆24Updated 2 years ago