sail-sg / sailor-llm
[EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia
☆125Updated last month
Alternatives and similar repositories for sailor-llm:
Users that are interested in sailor-llm are comparing it to the libraries listed below
- 🚢 Data Toolkit for Sailor Language Models☆85Updated last month
- Reformatted Alignment☆113Updated 4 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆201Updated last month
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆157Updated 6 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆209Updated 3 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- FuseAI Project☆80Updated this week
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆48Updated 3 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆71Updated 7 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆76Updated 4 months ago
- ☆64Updated last month
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆125Updated 6 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆91Updated last month
- ☆72Updated 2 weeks ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆98Updated 6 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- The HELMET Benchmark☆109Updated last week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆138Updated 4 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- ☆45Updated 7 months ago
- Code for KaLM-Embedding models☆68Updated 2 weeks ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated 8 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆156Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆128Updated 2 months ago
- ☆36Updated 4 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆93Updated last month
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆47Updated 5 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 7 months ago
- List of papers on Self-Correction of LLMs.☆70Updated last month
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 7 months ago