NVIDIA / NeMo-SkillsLinks

A project to improve skills of large language models

☆568

Alternatives and similar repositories for NeMo-Skills

Users that are interested in NeMo-Skills are comparing it to the libraries listed below

Sorting:

NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆842Updated 2 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆472Updated 2 weeks ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆301Updated this week
allenai / olmes
Reproducible, flexible LLM evaluations
☆251Updated 2 months ago
mlfoundations / evalchemy
Automatic evals for LLMs
☆533Updated 3 months ago
huggingface / Math-Verify
☆948Updated 3 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆343Updated 9 months ago
NVIDIA-NeMo / RL
Scalable toolkit for efficient model reinforcement
☆910Updated this week
NovaSky-AI / SkyRL
SkyRL: A Modular Full-stack RL Library for LLMs
☆906Updated this week
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆875Updated last week
huggingface / cosmopedia
☆541Updated 10 months ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,109Updated 4 months ago
HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆439Updated 11 months ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆747Updated last year
knoveleng / open-rs
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
☆261Updated 4 months ago
tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆362Updated last year
xfactlab / orpo
Official repository for ORPO
☆464Updated last year
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆805Updated 10 months ago
eddycmu / demystify-long-cot
☆318Updated 4 months ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆473Updated last year
magpie-align / magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆775Updated 6 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆284Updated last week
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆640Updated 3 months ago
project-numina / aimo-progress-prize
☆466Updated last year
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆310Updated last year
facebookresearch / swe-rl
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
☆602Updated 6 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆195Updated 6 months ago
NVlabs / Minitron
A family of compressed models obtained via pruning and knowledge distillation
☆352Updated 10 months ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆443Updated 4 months ago
thinking-machines-lab / batch_invariant_ops
☆773Updated 3 weeks ago