aisingapore / sealion
South-East Asia Large Language Models
☆270Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for sealion
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆149Updated 3 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆68Updated 2 months ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆314Updated 3 months ago
- ☆59Updated 6 months ago
- ☆46Updated 3 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆25Updated 6 months ago
- ☆13Updated 11 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆40Updated 3 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆91Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆38Updated 2 years ago
- Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)☆33Updated 11 months ago
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆112Updated 2 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆64Updated 8 months ago
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆57Updated last month
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆32Updated 8 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆24Updated 6 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆246Updated 2 weeks ago
- Build English-Vietnamese machine translation with ProtonX Transformer. :D☆64Updated 3 years ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆118Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆261Updated 4 months ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆67Updated 10 months ago
- ☆60Updated last year
- Best practices for distilling large language models.☆402Updated 9 months ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆34Updated 11 months ago
- ☆25Updated 9 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆111Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆264Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆559Updated 6 months ago