bigcode-project / Megatron-LMLinks

Ongoing research training transformer models at scale

☆390

Alternatives and similar repositories for Megatron-LM

Users that are interested in Megatron-LM are comparing it to the libraries listed below

Sorting:

nlpxucan / evol-instruct
☆270Updated 2 years ago
loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
☆192Updated 2 years ago
salesforce / CodeGen2
CodeGen2 models for program synthesis
☆272Updated 2 years ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
abacaj / code-eval
Run evaluation on LLMs using human-eval benchmark
☆417Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆524Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆721Updated 6 months ago
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year
bigcode-project / octopack
🐙 OctoPack: Instruction Tuning Code Large Language Models
☆472Updated 6 months ago
bigscience-workshop / xmtf
Crosslingual Generalization through Multitask Finetuning
☆537Updated 10 months ago
microsoft / CodeT
☆661Updated 9 months ago
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
conceptofmind / toolformer
☆366Updated 2 years ago
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆352Updated 2 years ago
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆553Updated last year
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆821Updated 2 years ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
salesforce / jaxformer
Minimal library to train LLMs on TPU in JAX with pjit().
☆292Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆304Updated 11 months ago
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆956Updated 9 months ago
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆227Updated 2 years ago
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆820Updated last year
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆503Updated 2 years ago
bigcode-project / starcoder.cpp
C++ implementation for 💫StarCoder
☆456Updated last year
nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆316Updated last year
huggingface / transformers-bloom-inference
Fast Inference Solutions for BLOOM
☆563Updated 9 months ago
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆371Updated last year
zphang / minimal-llama
☆458Updated last year
FSoft-AI4Code / CodeCapybara
Open-source Self-Instruction Tuning Code LLM
☆169Updated 2 years ago