salesforce / xgenLinks

Salesforce open-source LLMs with 8k sequence length.

☆721

Alternatives and similar repositories for xgen

Users that are interested in xgen are comparing it to the libraries listed below

Sorting:

arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆821Updated 2 years ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆304Updated 11 months ago
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆553Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆952Updated 9 months ago
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆702Updated last year
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆390Updated 11 months ago
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆489Updated last year
nlpxucan / evol-instruct
☆270Updated 2 years ago
salesforce / CodeGen2
CodeGen2 models for program synthesis
☆272Updated 2 years ago
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆352Updated 2 years ago
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆727Updated last year
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆820Updated last year
sabetAI / BLoRA
batched loras
☆344Updated last year
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆465Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆524Updated last year
zphang / minimal-llama
☆458Updated last year
SkunkworksAI / hydra-moe
☆416Updated last year
manyoso / haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…
☆222Updated 2 years ago
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,061Updated last year
conceptofmind / toolformer
☆366Updated 2 years ago
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆501Updated last year
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆989Updated last year
h2oai / h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning
☆312Updated 9 months ago
Victorwz / LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
☆802Updated last year
salesforce / DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
☆511Updated 6 months ago