abacaj / awesome-transformersLinks
A curated list of awesome transformer models.
☆666Updated 2 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below
Sorting:
- ☆587Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆771Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆444Updated 2 years ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆596Updated last year
- Ask Me Anything language model prompting☆545Updated 2 years ago
- An open collection of implementation tips, tricks and resources for training large language models☆482Updated 2 years ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,477Updated 6 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,011Updated last year
- An open-source implementation of Google's PaLM models☆816Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆722Updated 9 months ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Updated 2 years ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,063Updated last year
- ☆268Updated 9 months ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆814Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆304Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆730Updated last year
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…☆492Updated last year
- ☆443Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆714Updated 2 years ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆356Updated 2 years ago
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- A curated index to track AI-powered products.☆775Updated last year
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,132Updated last year
- An open collection of methodologies to help with successful training of large language models.☆536Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆491Updated last year
- Tune any FALCON in 4-bit☆464Updated 2 years ago
- ☆457Updated 2 years ago