Kipok / NeMo-Skills

A pipeline to improve skills of large language models

☆191

Related projects ⓘ

Alternatives and complementary repositories for NeMo-Skills

HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆358Updated last month
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆199Updated 6 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆195Updated 2 weeks ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆438Updated 8 months ago
QwenLM / AutoIF
☆217Updated 3 months ago
OpenBMB / InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆285Updated last month
Psycoy / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆224Updated last week
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆293Updated 11 months ago
for-ai / parameter-efficient-moe
☆247Updated last year
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆307Updated 7 months ago
MARIO-Math-Reasoning / Super_MARIO
☆252Updated last month
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆124Updated 3 weeks ago
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆129Updated 2 months ago
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆178Updated 3 months ago
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆230Updated 7 months ago
GAIR-NLP / ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
☆191Updated last month
TIGER-AI-Lab / LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"
☆91Updated 4 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆236Updated 4 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆145Updated 2 weeks ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆112Updated last month
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆124Updated 4 months ago
huggingface / cosmopedia
☆451Updated 3 weeks ago
SALT-NLP / demonstrated-feedback
☆112Updated last month
deepseek-ai / ESFT
Expert Specialized Fine-Tuning
☆145Updated last month
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆277Updated 2 months ago
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆146Updated 3 weeks ago
MozerWang / Loong
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆92Updated last week
OpenBMB / Eurus
☆287Updated 2 months ago
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆620Updated this week
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆219Updated 2 months ago