masakhane-io / masakhanePreprocessorLinks
Building an effective preprocessing tool for African languages
☆13Updated last year
Alternatives and similar repositories for masakhanePreprocessor
Users that are interested in masakhanePreprocessor are comparing it to the libraries listed below
Sorting:
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆77Updated 3 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆33Updated last year
- Crosslingual Question Answering for African Languages☆31Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago
- ☆12Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Updated 2 years ago
- MAFAND-MT☆59Updated last year
- Text simplification for a better world: Deep-Martin Transformer 🤗☆22Updated 2 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Updated 4 years ago
- PyLate efficient inference engine☆65Updated last month
- POS for African languages☆19Updated 3 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- A Streamlit app to extract keywords using KeyBert☆36Updated 4 years ago
- meta_llama_2finetuned_text_generation_summarization☆21Updated 2 years ago
- Hinglish Text Classification☆30Updated 2 years ago
- A research buddy which helps you asks questions on certain research papers, get insights on top research papers.☆21Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated last month
- This repository will guide you to create ChatGPT like chatbot using OpenAI's GPT 3.5 model☆42Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Learning PyTorch through the D2L book. A series of notebooks for the same☆27Updated 3 years ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆23Updated 2 years ago