aisingapore / sealion
South-East Asia Large Language Models
☆242Updated last week
Related projects: ⓘ
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆140Updated last month
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆63Updated 2 weeks ago
- ☆57Updated 4 months ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆311Updated last month
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆23Updated 4 months ago
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆29Updated 6 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆19Updated 2 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆25Updated 4 months ago
- ☆12Updated 9 months ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆22Updated last year
- RAG for Vietnamese Wikipedia corpus.☆23Updated 9 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆89Updated last year
- ☆60Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆37Updated 2 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆53Updated 10 months ago
- Best practices for distilling large language models.☆371Updated 7 months ago
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆53Updated this week
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆109Updated last year
- ⚡ LLaMA-2 model experiment☆13Updated 9 months ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆67Updated 7 months ago
- The code used to train and run inference with the ColPali architecture.☆512Updated this week
- Generative Representational Instruction Tuning☆527Updated 2 weeks ago
- Sample solution for MLOps Marathon 2023☆29Updated last year
- ☆419Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆118Updated last week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆253Updated 2 months ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆25Updated last year
- Bud500: A Comprehensive Vietnamese ASR Dataset☆65Updated 6 months ago
- Official repository for ORPO☆409Updated 3 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆37Updated last month