castorini / afribertaLinks
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
☆74Updated 3 years ago
Alternatives and similar repositories for afriberta
Users that are interested in afriberta are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆55Updated 10 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆105Updated last year
- Crosslingual Question Answering for African Languages☆30Updated 8 months ago
- ☆110Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated last year
- ☆17Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- COMET for African languages☆10Updated 4 months ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆81Updated 8 months ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 2 months ago
- ☆43Updated 2 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- This is a neural spell checker☆65Updated 2 years ago
- A python package to augment text data using NLP.☆39Updated 3 months ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated 11 months ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- Some notebooks for NLP☆204Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- Hinglish Text Classification☆30Updated last year