bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆384Updated 7 months ago
Alternatives and similar repositories for Megatron-LM:
Users that are interested in Megatron-LM are comparing it to the libraries listed below
- Fine-tune SantaCoder for Code/Text Generation.☆191Updated 2 years ago
- CodeGen2 models for program synthesis☆274Updated last year
- Official repository for LongChat and LongEval☆517Updated 10 months ago
- ☆268Updated last year
- ☆428Updated 7 months ago
- Run evaluation on LLMs using human-eval benchmark☆404Updated last year
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆462Updated 2 months ago
- Fast Inference Solutions for BLOOM☆561Updated 6 months ago
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,140Updated last year
- Crosslingual Generalization through Multitask Finetuning☆530Updated 6 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆716Updated 2 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- Minimal library to train LLMs on TPU in JAX with pjit().☆283Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆313Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆544Updated last year
- ☆356Updated 2 years ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆585Updated last year
- A framework for the evaluation of autoregressive code generation language models.☆921Updated 5 months ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆351Updated last year
- ☆745Updated 9 months ago
- Inference code for Persimmon-8B☆415Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆223Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- Open Source WizardCoder Dataset☆157Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated 2 years ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆804Updated 9 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆930Updated 5 months ago