UMxYTL-AI-Labs / MalayMMLULinks
[MalayMMLU] This is the first-ever Bahasa Melayu multitask benchmark designed to elevate the performance of Large Language Models (LLMs) and Large Vision Language Models (LVLMs).
☆50Updated last month
Alternatives and similar repositories for MalayMMLU
Users that are interested in MalayMMLU are comparing it to the libraries listed below
Sorting:
- Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/☆507Updated this week
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆325Updated last month
- South-East Asia Large Language Models☆360Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆404Updated 2 weeks ago
- ☆207Updated 2 weeks ago
- Build datasets using natural language☆532Updated last month
- implement RED metrics in fastapi integrate with Prometheus and Grafana☆40Updated 7 months ago
- Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"☆160Updated 3 months ago
- lightweight, python based chat ui☆336Updated 6 months ago
- This project focuses on fine-tuning a BERT model for question answering using a limited dataset for illustration purposes.☆30Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆86Updated 3 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 2 months ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆173Updated last year
- Fast Semantic Text Deduplication & Filtering☆816Updated 2 weeks ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆92Updated 8 months ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Updated last year
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆72Updated last year
- ☆157Updated 6 months ago
- Train LLM on Hugging Face infra☆64Updated last month
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 8 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆175Updated last year
- ☆207Updated 4 months ago
- A repository containing general tutorials I'd like to share with the world.☆46Updated 3 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆162Updated 4 months ago
- ☆207Updated last year
- Scripts for text classification with llama and bert☆27Updated 2 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆336Updated 4 months ago
- Enhancing Translation with RAG-Powered Large Language Models☆83Updated 3 weeks ago
- ☆119Updated last year
- From data to vector database effortlessly☆82Updated 5 months ago