BaichuanSEED / BaichuanSEED.github.io
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline"
☆15Updated 3 weeks ago
Related projects: ⓘ
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆27Updated this week
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆54Updated 6 months ago
- ☆13Updated last month
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆13Updated last year
- ☆34Updated 2 weeks ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆60Updated 3 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆34Updated 2 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆59Updated 9 months ago
- Code implementation of synthetic continued pretraining☆13Updated this week
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆48Updated last week
- ☆31Updated 5 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆31Updated last month
- The paper list of multilingual pre-trained models (Continual Updated).☆15Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆33Updated 6 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 8 months ago
- Code for Suri: Multi-constraint instruction following for long-form text generation☆15Updated last week
- ☆87Updated 4 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising☆32Updated 2 months ago
- ☆12Updated 7 months ago
- ☆14Updated last week
- ☆22Updated 3 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆73Updated 7 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆17Updated last month
- Cascade Speculative Drafting☆23Updated 5 months ago
- ☆60Updated 5 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆34Updated 2 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting☆60Updated 6 months ago
- ☆16Updated 6 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆39Updated 6 months ago