π’ Data Toolkit for Sailor Language Models
β95Feb 24, 2025Updated last year
Alternatives and similar repositories for sailcraft
Users that are interested in sailcraft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP-2024] βοΈ Sailor: Open Language Models for South-East Asiaβ139Dec 21, 2024Updated last year
- β21Apr 16, 2025Updated last year
- π± Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMsβ73Mar 21, 2025Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoningβ¦β22Nov 2, 2021Updated 4 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"β89May 11, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Aug 15, 2023Updated 2 years ago
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.β18Dec 27, 2022Updated 3 years ago
- A lightweight script for processing HTML page to markdown format with support for code blocksβ82Apr 14, 2024Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?β11Apr 18, 2023Updated 3 years ago
- β13Sep 6, 2022Updated 3 years ago
- β15Mar 12, 2024Updated 2 years ago
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)β65Jan 11, 2025Updated last year
- An Empirical Study of Memorization in NLP (ACL 2022)β13Jun 22, 2022Updated 3 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]β148Oct 27, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asiaβ174Jul 30, 2024Updated last year
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"β20Jul 23, 2021Updated 4 years ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)β85Oct 23, 2024Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Schedulingβ43Dec 29, 2025Updated 5 months ago
- Difference-based Contrastive Learning for Korean Sentence Embeddingsβ23Mar 11, 2026Updated 3 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"β24Apr 30, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problemsβ14Apr 10, 2026Updated 2 months ago
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β192Feb 17, 2025Updated last year
- β40Oct 10, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β14Sep 30, 2021Updated 4 years ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented anβ¦β28Sep 27, 2024Updated last year
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that β¦β13Nov 20, 2024Updated last year
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representationsβ19Oct 18, 2025Updated 8 months ago
- An automated tool for discovering insights from research papaer corporaβ137Jun 8, 2024Updated 2 years ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS β¦β60Oct 11, 2024Updated last year
- Waffer-thin FlaskGPT on Vercel.β12Jun 1, 2023Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixingβ32Jan 17, 2025Updated last year
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignmentβ83Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance anβ¦β14May 6, 2023Updated 3 years ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoningβ26Mar 3, 2025Updated last year
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learningβ14Jun 3, 2025Updated last year
- Learning to Rewrite for Non-Autoregressive Neural Machine Translationβ21Dec 23, 2021Updated 4 years ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notationβ14Jan 2, 2026Updated 5 months ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesisβ11Feb 17, 2023Updated 3 years ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".β65Apr 18, 2023Updated 3 years ago