aisingapore / sealion
South-East Asia Large Language Models
☆287Updated last month
Alternatives and similar repositories for sealion:
Users that are interested in sealion are comparing it to the libraries listed below
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆157Updated 6 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆74Updated last week
- ☆64Updated 8 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 8 months ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆125Updated last month
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆111Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆33Updated 11 months ago
- Machine Reading Comprehension special for the Vietnamese language☆39Updated 2 years ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆36Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- ☆63Updated last year
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆25Updated last year
- ⚡ LLaMA-2 model experiment☆13Updated last year
- ☆14Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated last week
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆31Updated 6 months ago
- ☆50Updated 5 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year
- ☆26Updated last year
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆125Updated 2 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆65Updated 11 months ago
- Sample solution for MLOps Marathon 2023☆29Updated last year
- Vietnamese self-supervised Wav2vec2 model☆61Updated 2 years ago
- ViHealthBERT: Pre-trained Language Models for Vietnamese in Health Text Mining (LREC 2022)☆14Updated last month
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 8 months ago
- ☆46Updated last year
- RAG for Vietnamese Wikipedia corpus.☆26Updated last year
- ☆23Updated last year