UMxYTL-AI-Labs / MalayMMLULinks
[MalayMMLU] This is the first-ever Bahasa Melayu multitask benchmark designed to elevate the performance of Large Language Models (LLMs) and Large Vision Language Models (LVLMs).
☆54Updated 3 months ago
Alternatives and similar repositories for MalayMMLU
Users that are interested in MalayMMLU are comparing it to the libraries listed below
Sorting:
- Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/☆514Updated this week
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆324Updated 3 months ago
- South-East Asia Large Language Models☆379Updated 2 weeks ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆94Updated 10 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆418Updated last week
- Sarjana is an open source desktop application which is used to assist in reading information materials, be it research papers or technica…☆24Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 4 months ago
- Build datasets using natural language☆552Updated 2 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆74Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆338Updated last year
- Scripts for text classification with llama and bert☆29Updated 4 months ago
- Recipes to prepare datasets!☆15Updated last week
- Model Activity Visualiser☆520Updated 8 months ago
- ☆235Updated 3 weeks ago
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data☆19Updated 4 years ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆271Updated last month
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆176Updated 3 weeks ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆89Updated 5 months ago
- Tool for generating high quality Synthetic datasets☆1,427Updated last month
- implement RED metrics in fastapi integrate with Prometheus and Grafana☆40Updated 9 months ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆173Updated last year
- An open-source tool for LLM prompt optimization.☆728Updated 3 weeks ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 8 months ago
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆481Updated 3 months ago
- Fast Semantic Text Deduplication & Filtering☆854Updated last month
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆446Updated this week
- ☆57Updated last year