sail-sg / sailor2Links
🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
☆67Updated 6 months ago
Alternatives and similar repositories for sailor2
Users that are interested in sailor2 are comparing it to the libraries listed below
Sorting:
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆64Updated 4 months ago
- ☆65Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated 5 months ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 6 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆64Updated 5 months ago
- Long Context Extension and Generalization in LLMs☆60Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- ☆73Updated 6 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 7 months ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆45Updated 11 months ago
- [EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia☆139Updated 9 months ago
- ☆94Updated last month
- ☆98Updated 10 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆55Updated last year
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆172Updated 2 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆39Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆30Updated last year
- Codebase for Instruction Following without Instruction Tuning☆35Updated last year
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆102Updated 7 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆73Updated 10 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆88Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆171Updated 3 months ago
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: …☆86Updated 2 weeks ago
- PASTA: Post-hoc Attention Steering for LLMs☆122Updated 10 months ago
- List of papers on Self-Correction of LLMs.☆76Updated 8 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 2 months ago
- Replicating O1 inference-time scaling laws☆90Updated 9 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆80Updated 3 weeks ago
- ☆100Updated last year