sail-sg / sailor2
π± Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
β60Updated last month
Alternatives and similar repositories for sailor2
Users that are interested in sailor2 are comparing it to the libraries listed below
Sorting:
- General Reasoner: Advancing LLM Reasoning Across All Domainsβ82Updated last week
- β63Updated last week
- Replicating O1 inference-time scaling lawsβ85Updated 5 months ago
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.β83Updated 2 months ago
- Exploration of automated dataset selection approaches at large scales.β40Updated 2 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β84Updated 7 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ47Updated 2 weeks ago
- β65Updated 2 months ago
- Long Context Extension and Generalization in LLMsβ55Updated 7 months ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Procesβ¦β47Updated 4 months ago
- Revisiting Mid-training in the Era of RL Scalingβ37Updated 3 weeks ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Schedulingβ29Updated last month
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systemsβ33Updated 7 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ95Updated last week
- Large Language Models Can Self-Improve in Long-context Reasoningβ69Updated 5 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"β42Updated 7 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ52Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"β146Updated 3 weeks ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]β134Updated 7 months ago
- β73Updated 6 months ago
- β64Updated last year
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"β38Updated last week
- β97Updated this week
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understandingβ51Updated 5 months ago
- Code for "Reasoning to Learn from Latent Thoughts"β94Updated last month
- β92Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuningβ34Updated 7 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".β55Updated 2 months ago
- β38Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"β179Updated 2 months ago