abacaj / awesome-transformersLinks

A curated list of awesome transformer models.

☆664

Alternatives and similar repositories for awesome-transformers

Users that are interested in awesome-transformers are comparing it to the libraries listed below

Sorting:

osanseviero / ml_timeline
☆587Updated 2 years ago
booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆770Updated 11 months ago
dair-ai / AI-Product-Index
A curated index to track AI-powered products.
☆775Updated last year
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆820Updated last year
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆546Updated 2 years ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,010Updated last year
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
ctlllll / LLM-ToolMaker
☆1,042Updated 2 years ago
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆595Updated last year
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆481Updated 2 years ago
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,063Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆721Updated 8 months ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆821Updated 2 years ago
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆505Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆627Updated last year
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆825Updated 2 years ago
zphang / minimal-llama
☆457Updated last year
SkunkworksAI / hydra-moe
☆415Updated last year
salesforce / CodeTF
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
☆1,477Updated 5 months ago
shm007g / LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
☆444Updated 2 years ago
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆354Updated 2 years ago
teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,636Updated 2 years ago
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆490Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆304Updated last year
radi-cho / datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
☆297Updated 2 years ago
pengbaolin / LLM-Augmenter
☆443Updated 2 years ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated 2 years ago
mallorbc / Finetune_LLMs
Repo for fine-tuning Casual LLMs
☆456Updated last year
OpenBioLink / ThoughtSource
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …
☆995Updated 9 months ago
danielgross / LlamaAcademy
A school for camelids
☆1,207Updated 2 years ago