abacaj / awesome-transformersLinks
A curated list of awesome transformer models.
☆658Updated 2 years ago
Alternatives and similar repositories for awesome-transformers
Users that are interested in awesome-transformers are comparing it to the libraries listed below
Sorting:
- ☆590Updated 2 years ago
- An open-source implementation of Google's PaLM models☆820Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆719Updated 5 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆591Updated last year
- Ask Me Anything language model prompting☆547Updated 2 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆819Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆765Updated 8 months ago
- Run inference on MPT-30B using CPU☆575Updated 2 years ago
- A curated index to track AI-powered products.☆768Updated last year
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,479Updated 2 months ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆726Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆477Updated 2 years ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,308Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,637Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆303Updated 10 months ago
- A tiny library for coding with large language models.☆1,233Updated last year
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,006Updated 10 months ago
- This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…☆488Updated last year
- An open collection of methodologies to help with successful training of large language models.☆503Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆445Updated 2 years ago
- ☆1,031Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆501Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,062Updated last year
- A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)☆1,125Updated last year
- A school for camelids☆1,208Updated 2 years ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆291Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,042Updated 11 months ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆352Updated 2 years ago
- Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks☆605Updated 2 years ago