masakhane-io / masakhanePreprocessorLinks
Building an effective preprocessing tool for African languages
☆13Updated last year
Alternatives and similar repositories for masakhanePreprocessor
Users that are interested in masakhanePreprocessor are comparing it to the libraries listed below
Sorting:
- Crosslingual Question Answering for African Languages☆31Updated 10 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 2 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- MAFAND-MT☆57Updated last year
- POS for African languages☆17Updated last month
- COMET for African languages☆10Updated 6 months ago
- ☆17Updated 2 years ago
- ☆110Updated last year
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆45Updated 2 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago
- ☆64Updated 2 years ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- NLP Examples using the 🤗 libraries☆41Updated 4 years ago
- Text simplification for a better world: Deep-Martin Transformer 🤗☆22Updated last year
- Learning PyTorch through the D2L book. A series of notebooks for the same☆27Updated 3 years ago
- Using short models to classify long texts☆21Updated 2 years ago
- A personal knowledge base that I can dump information to and help me learn☆24Updated 2 months ago
- ☆40Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Pinecone text client library☆65Updated this week
- Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-re…☆16Updated 4 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year