sail-sg / sailcraftLinks
π’ Data Toolkit for Sailor Language Models
β95Updated 10 months ago
Alternatives and similar repositories for sailcraft
Users that are interested in sailcraft are comparing it to the libraries listed below
Sorting:
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)β145Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]β149Updated last year
- β129Updated last year
- β161Updated last year
- Reformatted Alignmentβ112Updated last year
- Official implementation for 'Extending LLMsβ Context Window with 100 Samples'β81Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ136Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructionsβ49Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: β¦β89Updated last month
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)β205Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Usersβ246Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ31Updated last year
- β69Updated 2 years ago
- β75Updated last year
- β62Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curationβ73Updated 7 months ago
- The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"β55Updated 2 months ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ43Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ102Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β55Updated last year
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paperβ¦β128Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ100Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.β141Updated 2 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β212Updated 6 months ago
- β59Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"β76Updated 7 months ago
- Complex Function Calling Benchmark.β159Updated 11 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".β113Updated 6 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkIβ95Updated 2 years ago