π± Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
β71Mar 21, 2025Updated last year
Alternatives and similar repositories for sailor2
Users that are interested in sailor2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP-2024] βοΈ Sailor: Open Language Models for South-East Asiaβ138Dec 21, 2024Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)β85Oct 23, 2024Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewardsβ47Apr 15, 2025Updated last year
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Modelsβ23Oct 22, 2024Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)β65Jan 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'β24May 20, 2025Updated 11 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimizationβ54Jul 15, 2025Updated 9 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.β136Mar 21, 2025Updated last year
- V1: Toward Multimodal Reasoning by Designing Auxiliary Taskβ36Apr 14, 2025Updated last year
- π’ Data Toolkit for Sailor Language Modelsβ96Feb 24, 2025Updated last year
- β17Dec 12, 2024Updated last year
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β652Jan 29, 2026Updated 3 months ago
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Modelsβ48Dec 4, 2024Updated last year
- Reinforcing General Reasoning without Verifiersβ99Jun 24, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [TMLR 2025] On Memorization in Diffusion Modelsβ31Oct 5, 2023Updated 2 years ago
- β13Jul 25, 2023Updated 2 years ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)β159Jul 8, 2025Updated 9 months ago
- A Self-Consistent Robust Error (ICML 2022)β68Jun 25, 2023Updated 2 years ago
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)β39Jan 23, 2024Updated 2 years ago
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β190Feb 17, 2025Updated last year
- Graph Diffusion Policy Optimizationβ42Mar 17, 2024Updated 2 years ago
- Collections of RLxLM experiments using minimal codesβ14Feb 17, 2025Updated last year
- β16Jul 23, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β22Dec 18, 2024Updated last year
- Official homepage for Tab-CoT: Zero-shot Tabular Chain of Thought (Findings of ACL 2023)β33May 31, 2023Updated 2 years ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)β27Feb 25, 2025Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"β22Sep 21, 2025Updated 7 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.β13Apr 9, 2024Updated 2 years ago
- Trending projects & awesome papers about data-centric llm studies.β40May 20, 2025Updated 11 months ago
- A Gym for Agentic LLMsβ478Jan 21, 2026Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.β23Jul 1, 2024Updated last year
- β13Feb 7, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β17Apr 10, 2024Updated 2 years ago
- an environment based on XLA for deep learning compiler optimization research.β24Mar 7, 2023Updated 3 years ago
- Simple and scalable tools for data-driven pretraining data selection.β29Jun 9, 2025Updated 10 months ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.β17Jul 18, 2024Updated last year
- β13Feb 2, 2022Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)β17Aug 22, 2025Updated 8 months ago
- Official code for "On Calibrating Diffusion Probabilistic Models"β30Feb 22, 2023Updated 3 years ago