salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆717Updated 2 months ago
Alternatives and similar repositories for xgen:
Users that are interested in xgen are comparing it to the libraries listed below
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆585Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆693Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆545Updated last year
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,141Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆294Updated 7 months ago
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- ☆459Updated last year
- Ask Me Anything language model prompting☆547Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆805Updated 9 months ago
- Inference code for Persimmon-8B☆415Updated last year
- CodeGen2 models for program synthesis☆274Updated last year
- Official repository for LongChat and LongEval☆517Updated 10 months ago
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆925Updated 3 weeks ago
- Ongoing research training transformer models at scale☆385Updated 7 months ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆760Updated 5 months ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆555Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆44Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆488Updated last year
- ☆1,468Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- ☆268Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,546Updated 2 years ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆930Updated 5 months ago
- LOMO: LOw-Memory Optimization☆984Updated 9 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆709Updated last year
- Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch☆640Updated 3 months ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆323Updated 3 months ago