EthioNLP / Ethiopian-Language-SurveyView external linksLinks
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities
☆17Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for Ethiopian-Language-Survey
Users that are interested in Ethiopian-Language-Survey are comparing it to the libraries listed below
Sorting:
- Different semantic models for Amharic☆22Jan 15, 2024Updated 2 years ago
- ☆16Dec 11, 2019Updated 6 years ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆37May 27, 2023Updated 2 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆43Aug 2, 2018Updated 7 years ago
- Lexical Data of Ge'ez Languages☆55Sep 14, 2022Updated 3 years ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Aug 17, 2021Updated 4 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆41Oct 13, 2022Updated 3 years ago
- An Amharic News Text classification Dataset☆38May 17, 2024Updated last year
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆11May 10, 2024Updated last year
- Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems☆12Dec 8, 2022Updated 3 years ago
- AmQA - The first Amharic Open Domain Question Answering Dataset☆13May 27, 2024Updated last year
- Binaural beats brain-wave app.☆16Nov 28, 2023Updated 2 years ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆18Oct 3, 2024Updated last year
- This is the repository for my Advanced Computing courses at the Computer Science and Technology department of Tsinghua University.☆25Jun 14, 2023Updated 2 years ago
- huggingface-based implementation of an open question answering model trained on the newsqa dataset.☆23Feb 6, 2023Updated 3 years ago
- ☆27Jan 12, 2023Updated 3 years ago
- Gemini CLI wrapper to serve Gemini models through an OpenAI-compatible API☆51Aug 7, 2025Updated 6 months ago
- MCP tool to allow multiple chains of thought☆43Jan 7, 2026Updated last month
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆45Aug 10, 2025Updated 6 months ago
- Morphological processing for languages of the Horn of Africa☆54Dec 27, 2025Updated last month
- RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.☆70Nov 26, 2025Updated 2 months ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Jan 10, 2024Updated 2 years ago
- Codebase for probing and visualizing multilingual models.☆49May 13, 2020Updated 5 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 3 years ago
- ☆47Jan 23, 2020Updated 6 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Dec 8, 2022Updated 3 years ago
- Treating UI Libraries as first class citizen and making sure they are headless :)☆485Dec 15, 2025Updated 2 months ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Aug 7, 2024Updated last year
- Generalist and Lightweight Model for Text Classification☆170Feb 8, 2026Updated last week
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- Advanced fetch wrapper for typescript☆952Dec 22, 2025Updated last month
- Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.☆1,241Jan 31, 2026Updated 2 weeks ago
- ☆1,296Dec 15, 2022Updated 3 years ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,070Oct 16, 2025Updated 4 months ago
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,272Sep 8, 2025Updated 5 months ago
- Handle roles and permissions in your Laravel application☆2,282Jan 6, 2026Updated last month
- A library for mechanistic interpretability of GPT-style language models☆3,073Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- SQL Database Explorer [SQLite, libSQL, PostgreSQL, MySQL/MariaDB, ClickHouse, DuckDB, Microsoft SQL Server]☆3,461Updated this week