castorini / afribertaLinks
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
☆74Updated 3 years ago
Alternatives and similar repositories for afriberta
Users that are interested in afriberta are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆56Updated 11 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- ☆109Updated last year
- Crosslingual Question Answering for African Languages☆30Updated 8 months ago
- ☆17Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆45Updated 2 years ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆48Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- HF's ML for Audio study group☆192Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Some notebooks for NLP☆204Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 2 months ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 3 months ago