BaichuanSEED / BaichuanSEED.github.io

Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline"

☆15

Related projects: ⓘ

kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆27Updated this week
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆54Updated 6 months ago
yale-nlp / refdpo
☆13Updated last month
THUDM / Efficient-Head-Finetuning
Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head
☆13Updated last year
Zheng0428 / COIG-Kun
☆34Updated 2 weeks ago
QwenLM / online_merging_optimizers
Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
☆60Updated 3 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆34Updated 2 months ago
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆59Updated 9 months ago
ZitongYang / Synthetic_Continued_Pretraining
Code implementation of synthetic continued pretraining
☆13Updated this week
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆48Updated last week
RUCAIBox / BAMBOO
☆31Updated 5 months ago
fairyshine / Seal-Tools
The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…
☆31Updated last month
kaiyuhwang / MLLM-Survey
The paper list of multilingual pre-trained models (Continual Updated).
☆15Updated 3 months ago
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆33Updated 6 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆44Updated 8 months ago
chtmp223 / suri
Code for Suri: Multi-constraint instruction following for long-form text generation
☆15Updated last week
yegcjs / mixinglaws
☆87Updated 4 months ago
weizhepei / InstructRAG
InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising
☆32Updated 2 months ago
dheeraj7596 / Small2Large
☆12Updated 7 months ago
jdf-prog / LLM-Engines
☆14Updated last week
Zyphra / Zyda_processing
☆22Updated 3 months ago
Re-Align / just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
☆73Updated 7 months ago
TsinghuaC3I / Intuitive-Fine-Tuning
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆17Updated last month
lfsszd / CS-Drafting
Cascade Speculative Drafting
☆23Updated 5 months ago
locuslab / scaling_laws_data_filtering
☆60Updated 5 months ago
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…
☆34Updated 2 months ago
cxcscmu / MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models
☆42Updated last week
thunlp / Ouroboros
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
☆60Updated 6 months ago
ChiyuSONG / dynamics-of-instruction-tuning
☆16Updated 6 months ago
liyucheng09 / Contamination_Detector
Lightweight tool to identify Data Contamination in LLMs evaluation
☆39Updated 6 months ago