aisingapore / sealion
South-East Asia Large Language Models
☆297Updated this week
Alternatives and similar repositories for sealion:
Users that are interested in sealion are comparing it to the libraries listed below
- ☆66Updated 10 months ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆163Updated 8 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 10 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆78Updated 2 months ago
- ☆67Updated 2 years ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 8 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆56Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 10 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆65Updated last year
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆25Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆321Updated 3 months ago
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 2 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- FSDS Webinar 1: Real-Time Machine Learning Inference with Spark Streaming and Kafka☆10Updated last month
- Build English-Vietnamese machine translation with ProtonX Transformer. :D☆66Updated 3 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆208Updated 5 months ago
- Phần mềm nguồn mở giúp mỗi cá nhân trực tiếp sử dụng ChatGPT và hơn thế nữa ngay trên máy tính của mình.☆35Updated last year
- Sample solution for MLOps Marathon 2023☆29Updated last year
- Translate large dataset to any language with google translation api and multithreads processing, no key required!☆68Updated 5 months ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- Convert LLaMA3.1-8B to DeepSeek R1 MLA & MoE (raw)☆19Updated 2 weeks ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆18Updated 3 years ago
- Let's build better datasets, together!☆257Updated 3 months ago
- ☆46Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆119Updated 7 months ago
- ☆26Updated last year
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆240Updated last year