abacaj / awesome-transformers
A curated list of awesome transformer models.
☆637Updated last year
Alternatives and similar repositories for awesome-transformers:
Users that are interested in awesome-transformers are comparing it to the libraries listed below
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,623Updated last year
- Ask Me Anything language model prompting☆544Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆717Updated 2 weeks ago
- CodeTF: One-stop Transformer Library for State-of-the-art Code LLM☆1,466Updated 3 weeks ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,060Updated 11 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆42Updated last year
- ☆1,026Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆879Updated 10 months ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆781Updated 10 months ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆760Updated 3 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆992Updated 6 months ago
- ☆456Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆818Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆584Updated last year
- ☆589Updated last year
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆936Updated 2 months ago
- An open collection of implementation tips, tricks and resources for training large language models☆469Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆705Updated last year
- A school for camelids☆1,210Updated last year
- C++ implementation for BLOOM☆810Updated last year
- An open-source implementation of Google's PaLM models☆818Updated 7 months ago
- A tiny library for coding with large language models.☆1,223Updated 7 months ago
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆436Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,109Updated last year
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,449Updated last year
- Prompt programming with FMs.☆440Updated 6 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,319Updated 8 months ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,305Updated last year
- Language Modeling with the H3 State Space Model☆516Updated last year