dheeraj7596 / Small2LargeLinks

☆17

Alternatives and similar repositories for Small2Large

Users that are interested in Small2Large are comparing it to the libraries listed below

Sorting:

sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 5 months ago
abhika-m / FAVA
☆73Updated last year
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Updated 9 months ago
awslabs / rag-qa-arena
☆49Updated 11 months ago
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
☆47Updated 6 months ago
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
fe1ixxu / CPO_SIMPO
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
☆55Updated 11 months ago
liuqi6777 / pe_rank
Leveraging passage embeddings for efficient listwise reranking with large language models.
☆46Updated 8 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
gangiswag / llm-reranker
☆50Updated 6 months ago
nlp-uoregon / ullme
☆20Updated 3 months ago
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆85Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆78Updated last year
shizhediao / Post-Training-Data-Flywheel
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
☆58Updated 10 months ago
LuLuLuyi / LongHeads
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆29Updated last year
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
AIR-Bench / AIR-Bench
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆150Updated last week
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆93Updated 8 months ago
qhjqhj00 / WebBrain
☆68Updated 2 years ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
OFA-Sys / DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
Hannibal046 / SelfMemory
[Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
☆61Updated 2 years ago
yifanzhang-pro / AutoMathText
Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…
☆84Updated 2 weeks ago
WadeYin9712 / Dynosaur
Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)
☆64Updated last year
tianyi-lab / Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆166Updated last month
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆152Updated last year
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆114Updated last year
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 6 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆51Updated last year
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆32Updated 8 months ago