sail-sg / sailor2
π± Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
β51Updated last month
Alternatives and similar repositories for sailor2:
Users that are interested in sailor2 are comparing it to the libraries listed below
- Long Context Extension and Generalization in LLMsβ50Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuningβ33Updated 5 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β80Updated 5 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsβ43Updated 7 months ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.β72Updated 3 weeks ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"β44Updated 3 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ44Updated 2 months ago
- β64Updated 4 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β70Updated last year
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Modelsβ45Updated last month
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).β25Updated 6 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"β51Updated last month
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or reβ¦β26Updated 5 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception oβ¦β22Updated 3 months ago
- β76Updated 2 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Modelsβ78Updated 8 months ago
- Large Language Models Can Self-Improve in Long-context Reasoningβ62Updated 3 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?β22Updated this week
- List of papers on Self-Correction of LLMs.β71Updated 2 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Modelsβ56Updated 3 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"β72Updated 9 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"β46Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"β163Updated last week